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AN APPLICATION OF FACTORIAL ANALYSIS 
TO THE STUDY OF TEST ITEMS 


By P. E. VERNON 


Institute of Education, University of London 


L Introduction. П. Data and Methods. MY. Analysis of Items. IV. Analysis of 
Тиги V. Functional Content of Item Factors. V1. Functional Content of Separate 
tems. VII. Final Analysis of Formal Factors. УЙШ. Conclusions Regarding Factor 


Analysis. IX. Summary. 


I. INTRODUCTION | 


In an earlier article (Vernon, 25), it was suggested that factor analysis could be 
useful in analysing the homogeneity of content of test items, though the amount of 
Work it involves limits its application to short tests. The present study was designed 
to discover how valuable would be the information obtainable about the items of a 
test of moderate length, namely, Progressive Matrices. This test was chosen, not with 
any intent to criticize, but because : 

(a) it is so widely used that fuller understanding of what it measures is desirable ; 
21 (b) answer sheets for the 20-minute version from 640 recruits, together with scores on 

other tests, happened to be available ; 

(c) the content of the test appears, 
Freee than that of many other popula 
ae if appreciable heterogeneity is foun 

Current methods of test construction. З " 

Several researches have in fact indicated that the items of which certain tests 
are composed do measure diverse abilities. Burt(1), Wright (29), Burt and John (3), 
and McNemar (12) have shown that the Stanford-Binet or Terman-Merrill scales 
embody several group factors over and above their general content. In McNemar’s 
a analyses at different age levels the mean first factor variance was close to 40 per 
mr and the mean variance of his bipolars 13 per cent. But the latter were exagger- 
(d ed by unreliable correlations from small groups, and amounted only to 10:5 per 

€nt. in analyses where 200 cases were available. 1 xb. 
h The relative sizes of general and bipolar variances naturally- depend "on the 
eterogeneity of the subjects tested. But one would suggest that, іп a representative - 
a the former should usually exceed the latter by at least four times. а That this 
^UE moderate demand is shown by Eysenck and Crown's (5) analysis of a 24-item 
attitude scale, There the general and bipolar factor variances were 48 per cent. and 
H pa cent. Another research on the Stanford-Binet test, by Wright, yielded variances 
60:8 per cent. and 17-3 per cent. among ten-year olds. But as the last two of 


right's six bipolars were almost certainly non-significant, it would be fairer to take 
The same criterion 


account only of the 12-9 i f the four chief ones 
“9 per cent. variance of the four caie я 
may be applied to the sub-tests of a battery. For example, the Army Summed 5.6. 
tive group of recruits 


1s be , 
based on scores from five tests. When analysed in a representa 0 
and three bipolars contribute 10:4 per 


the first f; > : 
actor E t 66:6, 
ame E 1, we should be much more doubtful 


from inspection, to be, if anything, more homo- 
intelligence, attainment, and aptitude tests. 
d, we might need to reconsider the soundness 
* Е 


‚ Cent. Had the ratio only reached, say, 3 to 


abo Е 
ut employing а summed score. 


^ ; \ 


An Application of Factorial Analysis 


An example of heterogeneity is provided by Guilford's (8) analysis of the ten sets 
of items in the Seashore Pitch Discrimination test, given to 300 students. He concludes 
that the more difficult sets (5 to 0-5 cycles) depend on entirely different factors from the 
easier sets (30 to 8 cycles). In spite of the test's apparently homogeneous material, it does 
not measure one and the same pitch ability throughout. An alternative possibility is that 
more able subjects use very different discriminatory methods from the less able. However 
Guilford notes that the two most difficult sets (1 and 0-5 cycles) are highly unreliable ; and 
if these are omitted the first factor variance of the remainder is 57:8 per cent., bipolar 
variance 15:8 per cent. Indeed, with a less selected group (for Pitch Discrimination 
correlates appreciably with intelligence), the general factor in these sets might easily have 
satisfied our suggested criterion. 


There is other evidence of ‘ difficulty factors.’ Burt and John, in their analysis of 
Stanford-Binet X- and XII-yr. items, obtained a factor separating almost all the former 
from the latter items. Hertzman (10) studied correlations between the easier and more 
difficult halves of a battery of group-tests, and concluded that they measure somewhat 
different abilities. Ferguson (6) points out that spurious factors corresponding to the 
difficulty of different sets of items will be introduced if Pearsonian product-moment OF 
point inter-correlations are analysed. Wherry and Gaylord (28) recommend substituting 
tetrachoric correlations, which are unaffected by item-difficulty ; but in point of fact this 


had already been done both by Burt and Guilford in the researches just mentioned. 
Another possibility is that the form of the items (open-type, analogies, classifica- 


tion, etc.) introduces unwanted factors. Guilford (7) claims that this is unimportant, 
and cites Smith's (18) investiga*ion, where 14 verbal, numerical, and spatial tests In 
analogies, classification} and open-response form were compared. Although Smith's 
application of tetrad analysis failed to reveal any clear-cut formal factors, a ге- 
analysis of his figures by the present writer, using group-factor technique, suggests 
the presence of a group factor with some 7 per cent. variance among multiple-choice 
tests (analogies and classifications) not present in open-response tests. The total 
variance of the factors based on the content of the tests among college students 
approximated 50 per cent. Most educational psychologists seem to regard an 
individual Stanford or Terman-Merrill І.О. as having greater practical utility than 
ап І.О. from an equally reliable group-test, and this may be because the latter 15 
distorted by some such formal factor, i.e., by the ability to do multiple-choice tests, 
which has been described by the writer elsewhere as ° test'sophistication ° (Vernon, ^7 


Thurstone (22) and Shaefer (15) have suggested that the perceptual speed factor, 
P, appears only among tests composed of easy items. Though it was present E 
tests of matching or identifying printed letters and numbers, diagrams and picture 
among Thurstone's (21) original college students, it failed to emerge consistently 
when similar tests were applied to schoolchildren, to whom such tests offered greater 
difficulty. It seems possible, again, that the word fluency factors, W ОГ 7, an 
differentiated from the yerbal factor, V, largely by the ease of the verbal items involve D 
since several factorial investigations have failed to establish a clear separation betwee 
them. This is bound up with the whole question of the existence and nature 0 
speed factor in intelligence and other mental tests, For if such a factor is promine di 
it will presumably affect the performance of less able testees on an ordinary time- m 
test more than it affects the more able ; and it may affect the later items of such a (5 
more than the earlier ones. In spite of the rejection of such a factor by Spearmo 
Nodes. ү EE MM of Slater (16), Davidson and Carroll (4) and Tate [o 

proved beyond doubt that т ig un 
appropriate E оноп: PME I Work gan be differentiated 


It might be thought that internal an i i o 
1 1 ; alyses which reveal heterogeneity 47! 
fuer can be disregarded, provided that scores on the test as a wile give useful 
predictions of some external criterion of educational or vocational success- put 


2 


usuz ai А i 
з ally obtains small spatial or k-loadings 
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15 i : р 
dictions Ду pru jt M gee асе during the war of irregular pre- 
а nt test-score lev i 
Bout lem els that the present writer has become concerned 
for pue ашау ор 600 naval radar operators yielded product-moment correlations of -26 and :35 
failures a ye Matrices and Arithmetic tests with training marks. But when the 4 per cent. of 
respectively, sh per cent. of passes were contrasted, the tetrachoric coefficients were +30 and —:13 
test, or by д fe EDU that regressions were non-linear. The abilities measured by the Arithmetic 
ic RETS erion marks, must have altered at the bottom end of the range. Similarly the 
orse trainine era based on discrimination of Morse patterns, gave moderate correlations with 
extremely ш resul Б among telegraphists when product-moment correlations were used, but was 
24) workell X 2t picking out failures. In contrast the Morse Learning test (described by Vernon, 
the signifie etter at the lower than at the upper ranges. There was seldom time to investigate 
ance of non-linearity, but it was definitely proven in one comparison, by Dr. Fraser 


obe У д 
с 91 (а Matrices test with the naval T2 battery. 
arly there is room for much more detailed analysis of what our tests measure, for determining 


the ex A 4 
Heeroma О Which all items depend on the same factor or combination of factors, and whether 
Th city of content or form is responsible for variations in predictive value. 
e Progressive Matrices test? was originally devised as a measure of g, but it 
, among adults, as shown by the writer 


elsewhere (24, 26). Its five sets of 12 items employ similar material, but differ slightly 


in their ; 
their approach. Raven (13) describes them as : 


ig Continuous patterns, 
el Analogies between pairs of figures, 
d) pLOBNISSIVS alterations of figures, 
(e) Re lions of figures, 
Бас} solution of figures into constituent parts. 
and Pis Starts with very easy items, so that separate instructions are not required, 
АБЧ finishes with moderately, or very, difficult items. Only one study—to the 
e S knowledge—has been made of the separate sets. , Rimoldi (14) factorized 
Era, And 14 other tests, which had been given to 138 children of mixed sex, and 
Б eese 8 to 15 years. Seven factors were extracted and rotated, and as might 
iB rected the results are merely confusing. However, the unrotated factors 
ipol. © соттоп variance іп the Matrices sets of some 55 per cent., together with 
* ars, which differ in the sets, of some 10 percent. 1 | 
the ee an ( 19) and others have suggested that different subjects might answer 
E by different methods, for example, synthetically or analytically. It seems 
Y also that some would resort to spatial imagery, and others might employ logical 


Verba “i g a 
pal analysis. Heterogeneity of method in the persons would, on Burts 
of item-content in the tests. 


ТЕС а j 
кошу. principle, presumably imply heterogeneity of 
emsel ecided in the present study both to analyse the items (not the sets) among 
Of wi il Ves, and to compare each item with other tests which would provide measures 
cll-established factors—g, v, л, k, mechanical information and perceptual speed. 


e 


II. DATA AND METHODS 


апа" ће following battery of tests was assembled by Science 4, the Air Ministry, 

all а реда у in the summer of 1948 to 640 Манова mper 
Close to 18 -nowledgements are due for рег 

ete ry RD m found in Vernon and Parry (27). 


Uset 
he results. Fuller descriptions of the tests may 
nce been claimed 


1 

ERE orio: ^ 
that (4 HBinal 1938 version wa d in this investigation. It has, however, Sit 
items пе later version, 1947, Set 1L already meets many of the weaknesses dece ute 
of difficuli anged in groups of four, instead of twelve, according (a) іо ANNIS investigations of 
adult intelligent P i be preferable for this test to be used in future largi 
AI 3 


An Application of Factorial Analysis 


Arithmetic А. Short problems. 

Calcs. A and Calcs. С. Four rules and mechanical arithmetic. 

V-4. Stephenson's mixed vocabulary, English, and reading comprehension test. 

V-V. A new vocabulary and comprehension test. 

Gen A-1. Crossing out wrongly spelled words. 

Gen A-2. Comprehension of paragraphs. , 

SP 14. A.T.S. Spelling ; identifying the correct spellings from several misspelled versions. 

SP 21.. Army Instructions or Clerical test. s 

SP 119. Giving the readings of pointers on scales and dials, and the co-ordinates of points 
on rectangular and polar graphs. 

Ins-A. Reading aeroplane panel dials. 

Obs-C. Matching aircraft silhouettes. 

Progressive Matrices. Raven's 1938 test, with 20-minute time limit. 

G-5. Stephenson's Matrices test, modified, with multiple-choice responses. 

K-6. Stephenson's spatial test, including paper formboard, copying figures onto dotted space, 
and drawing mirror images. 

SP 4. National Institute of Industrial Psychology's spatial judgment test, Squares. 

Group Test 80. N.LL.P. test of recognition of shapes turned round or over. 

SP 122. Army revision of Bennett Mechanical Comprehension. 

Mec-B. Diagrams of machines, comprehension questions. 

Mec-C. Multiple-choice mechanical information questions. e 

SP 117 E and M. Questions on everyday electrical and mechanical matters, open-response. 


- Table I gives the 20-minute Matrix norms believed to be correct for 18-year 
males in the general population, and the distribution in our sample. 


TABLE L PROGRESSIVE MATRICES DISTRIBUTIONS 


Score | 18-yr. Population R.A.F. Sample 
Per cent. Per cent. 
50+ 555 b. zo 10 178 
46+ Bs ae X 20 25-7 
42+ dc eae 20. 256 
36-- И 5. 20- 21:6 
284 T5 ET жү, 20 8:7 
27 and under ... d 10 s 


оог o Eo OOO Na 3 
T 


Unfortunately the sample is considerably above average ; and, though it 1S. 28 
heterogeneous as those normally encountered іп educational or vocational investiga" 
tions outside the Army, the Standard Deviation of scores is only 618. In more 
representative Army groups the Standard Deviation is about 8:50. Not only does 
this reduce the common-factor variance, but it also means that few recruits fail the 


casieritems. Actually in 30 items out of 60 the number failing amounted to less than 
10 per cent. 
"Tetrachoric correlations can still be obtained between i i igh) failui 
a een items with very low (or very high) a 
rates, but they are extremely unreliable. Thus the S.E. of zero r, OR Kelley’s form 
(11, No. 211) is 062 when both items are passed by about 50 per cent. of a group of 640... оңу 
for the correlation between items C 1 (p = +970) and E 12 (p = +041), it rises to -223. This асоту 
will always arise in а test covering a wide range of ability. It might be met by taking an abnormi ng 
heterogeneous population, or, following McNemar, by making several overlapping analyses атоо 
low grade recent populations: However most tests are not, like Matrices, inten d 
en es from 8 up to superior adult, and their i n i 
ind 30 per cnt difficulty, > an eir items do normally lie between р » 
e 18 easiest items were omitted altogether ; and the p values (difficulties of the remain? 
42 appear in Table II, Col. 2. The S.E. of zero r between fach we and s vee whose P = 50 
is given in Col. 3. For any other pair of items, the S.E. of zero r is given by the product of th H 
е: multiplied py 161. non шше, Clwih E12: 123 x 112x161 = 2232 ae 
ive difficulties parallel very closely those quoted by Slat rmy recru? 
the rank order correlation between them being 046. 8 a exoup GST Ашу 


4 


P. E. VERNON 


Scores o 22 i 
es on the 22 tests were split at the median. It might be preferable to correlate each Matrices 
would have enormously increased 


item wi 4 

the NAVI the corresponding difficulty level, but this 

at every level xd e en themselves would then have had to be inter-correlated and factorized 

single level for the А, jest at several different difficulty levels. It appeared better to maintain a 
е medium, but еа n order to provide a stable frame of reference factors with which not only 

Tabulation of tt пе very easy and very difficult, items could be compared. 

Punching was not he 2,016 correlations between 42 items and 22 tests was done by hand, since 

SEs, and in ord readily available. Tetrachorics were read off from graphs. In view of the large 

of the subse er to save time in what was intended solely as an exploratory investigation, most 

quent calculations were done to two decimal places only. ‘ 


Ill. ANALYSIS OF ITEMS 

s" of variables, such as test items 
lyses were therefore made of the 
est correlations. The first was 
factors extracted are shown in 
d after totalling the columns at 
the highest r in each column 


and oy ie unwise to factorize two * universe: 
inter-item ce a single matrix. Separate ana 
Carried out b Panon the inter-test and the item-t 
Cols. 4-8 г y the Centroid method, and the four 

of Table II. Communalities were guesse 


each stag 7 

ge; abet Wwe. 
Was used. neither successive approximation nor 
y assume (with McNemar) that 


It is diffic P. s 

a f; is difficult to evaluate significance, but if we ma 
n, then the figures in the third 
factof loadings. For the sth 


actor is : х : = 

column of Чер to a variable split at the media 
able II give us the S.E. of zero r for first- 

n . times larger. The 


fac 
5 о, the S.E., according to Burt and Banks (2) is M ET 
Titers raras e Ex. 
E Стане is to regard a factor as significant if half the loadings exceed 2xS.E. 
Criterion kx of 42 fourth-factor loadings reach this level. . On Guilford and Lacey's (9) 
id not ar ЗО; the fourth factor is close to the borderline of significance. Hence 1t 
Sommuncihe justifiable to extract a fifth or further factors. Nevertheless the total 
t ia ity obtained, 39-7 per cent., is a conservative estimate ; and it js likely 
More efficient techniques of analysis, such as Maximum Likelihood, might 


N 
aise о perhaps 42 per cent. | 
аттен Шуй is, by itself, not very informative. Most of the item АЕ 
Ipolars e can be accounted for by Factor I, with 26-0 per cent. variance, ut 0 8 

Positive "Ver another 13-7 per cent. If rotation was attempted to factors with 
depend al factor would largely disappear, and 
ent factors corresponding quite closely to Raven’s sets of items (44-5, С, 
factorists would almost certainly 


> and E) w Е 
"оша be obtai st American ; 
ained. Mo However the heterogeneity 


Sonclug 
Not re, that each set is measuring a different ability. 1 у 
М the first being the attenuation of Factor 


ally S T 
the O serious, for two reasons 
оте mogeneity of the population. "The total variance of 39-7 per cent. p 
With Stoney Kelley’s formula (11, No. 178) to the value expected in a popu, 
Increase үү ard Deviation 8:50. It then becomes 68:2 Der cent., and 1 I meris 
тоге th ics [er in Factor I, the variance of this factor might reac » 
7 ouble the obtained value. Note that this figure is J) A 
Suggested oee the Matrices test appears to satisfy the criterion of ш 
theless it above (unless the bipolars also swelled in а representative group» how 
Telatively 15 Instructive to realize how small neral factor may be, ап o 
4 Sreat the heterogeneity, in a test app i ) ately selected por Bie 
а true Sly, it is probable that factors with all-positive oue. bipolar. 
Consiqer ше of item structure, and that at least one factor shee а 
2 ny bien of the answer sheets and item-correlations soon | g 
Jects spend so much time on the more difficult early items 
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have time to tackle Sets D and E, whereas others guess freely and have more time 
to score on the later sets. If so, some groups of items really are negatively correlated 
with others. Doubtless this phenomenon arises largely because the test is applied 
with a time limit, which was not intended by its author. The moralis clear: tests 
given with insufficient time should have the items arranged in order of difficulty 
throughout, and if the items fall into distinctive sets, they should be arranged in 
omnibus or cyclic form. Otherwise the total scores of some subjects will be based to 
too great an extent on some of the sets, and of other subjects on other sets. 


There is unfortunately no direct means of telling which bipolar originates in this manner. Order 
of difficulty gives some clue, although the subjective difficulty felt by the average testec does not 
necessarily correspond closely to the proportions of failures, 1—p. By rotating Factors IL and ш 
through tan“ -5, a bipolar is obtained whose loadings give a tetrachoric correlation of 73 with Р. 
This is the best approximation we can get to a difficulty factor. It is shown in Col. 16 of Table IL. 

We cannot decide, either, whether our factors derive from the form or the functional content © 
the items. Probably all of them are mixtures. As already pointed out, the bipolars differentiate 
Rayen’s sets ; but it may well be that these sets differ in function (e.g., dependence on k) as we 
аз їп form. Although Factor IV yields mixed positive and negative loadings for at least two Sed 
C and E, we cannot assume that this represents differentiation by function. It may merely separa 9 
the more from the less difficult items within these sets. Thus the six most difficult items In Set 
all have positive loadings and three of the four easier ones have negative. Similarly Items E qa 
are all negative. 


. How far do general and bipolar variances differ with difficulty ? Table IH 
gives the results for the easiest, medium, and most difficult items. 


€ 
TABLE Ш. FACTOR VARIANCES OF EASY AND DIFFICULT ITEMS 


Factor | 1 II ш IV | " 
| | 
22 easy items, p = -7 to 1-0 a: | 236 58 53 435, 11200 
11 medium items, p = -3 to -6 | 352 43 25 22 359 
9 difficult items, p = 0 to -2 | 5 9:8 35 21 
| | — 
| 
All items ee ast ар O0 63 41 33 39:7 


Apparently the medium ones have the greatest homogeneity and least bipolar Maher 
The easy and difficult ones both have higher than average bipolar content, but tà to 
low Factor I content is even more noticeable. The tendency for extreme items 
be weaker than medium ones (even when consistency or validity is determine 
tetrachoric or biserial correlation) was pointed out in the writer’s survey of it Пу 
analysis techniques. The figure for the most difficult Matrices items is spect” 
low because thcy include two items that are almost useless for adults, D кР Ha ой 

Any information about test items, based on general factor loadings; coul on 
course be obtained much more simply by the stock item-analysis techniques ile 
the whole, then, factor analysis of items by themselves hardly seems worth Уш 
provided that the internal consistency or reliability of the test is sufficiently high 
ensure the dominance of a general factor. The bipolars have told us nothing. GR 
that it is probably a mistake to give Matrices with a time limit. Conceivably ev 
would Бе more useful in the case of a test where their meaning could be more rea E 
identified by studying the content of the items they differentiate. Yet 0 
McNemar's Terman-Merrill analyses, interpretation was often difficult. . it 
important reason for this is that the bipolar loadings are inevitably low in reliability 
when based on tetrachoric item-correlations. 
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IV. ANALYSIS OF TESTS 


The inter-test correlations were analysed by grou) 
Ч : group-factor methods i 
ү от. авы: of the same battery (slightly abbreviated) having denied the 
ae n. ure of the tests (Vernon, 26). G loadings were chosen to reduce to 
co deines e correlations between sets of tests unlikely to have any group factor in 
Үд bo 057 Cales. А and C ; G-5 and Matrices ; Mec-C, 117 E and M. Similarly 
сев r " ‚ and SP 14 should be independent (apart from g) both of G-5 and 
as Obs-C T et K-6, SP 4, and Group Test 80. The g-loadings of other tests, such 
block DE pe Reading, were obtained by summing their correlations with the 
of this block E which they were least related, and dividing by the summed g-loadings 
ls T AT dmittedly there is an element of subjective judgment in this procedure, 
as guided both by the centroid analysis and by many previous analyses of most 


of Stee 
the tests in comparable populations. 


TABLE IV. FACTOR ANALYSIS OF TESTS 


Е. Ten FACTO MEM 
Test g ned. y Tech- к Per- he 
o. Ж nical ceptual 
ie А ш N A (es AO :836 
GE E е D2 ву 259 :956 
БАСС. ш, ш, с E uh CE 874 
У-у Verbal | 35 (3 4 E 801 
Gen Wosabulary and Comprehension... | -63 58 (13) -750 
en A.» ppeling we ... .67 (19 49 | +725 
SP 14 Spell E aPh Comprehension ... -70 -38 :634 
= ing. ш. с EODD CSE) oa (15) | :937 
D 
СЫ унше a. sa, шш 28 -8 | :928 
InjA ТУСШ and Graph Reading ... | 788 — (09) (19) 24 | -876 
sc M Reading .. —.. 7 2. | 20 (13) 26 | 895 
Eo silhouettes |... EN (er (19) (34) 22099] -481 
^. c9Bressive Matri - 
9-5 atric о, '27. "719 
ҘАР Make zo mo 7 39 (29 388 
-6 RA " 
р .F. Spati 1 64 975 
ro Squares Р ш 2 T 38 -448 
8 P Test 80 [ese (2) 53 702 662. 
Р 125 | 
Mec-p Ргасіса! Problem : 34 Саз) (28) | 725 
2 5 Bee ress sis 56 (18) E 
Месо Mechanical Comprehension | 34 (9 51 пу 2D) 2 
RE 117 ташса! Information ... | 62 (28 50 P 
Pun M TAL Information  . | 59 (28) 747 | «688 
lechanical Information ... | ‘50 (30 59 
60 2:3 771 
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ee arithmetic tests 


ji the thri 
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After 

gave a Temoving g, the promi 
Ormula n (number) loadings shown in Tal these three, and 

Sir z 5 Certain other tests had smaller po à h elementary 
ducar; Wations are shown in brackets. Clearly this : 7 ding, but 
the Verbal as a pure arithmetic factor. Spelling obtains a considerable Pons 
Similar] al comprehension tests V-V and Gen A-2 are free from it. Five te the five 
Y yielded the next, v, factor and this entered to some extent into 
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mechanical tests. Although this finding is unusual, it is quite plausible, since the 
mechanical tests obviously involve the understanding of printed material. 


The next factor probably represents mechanical information and technical educa- 
tion, since Arithmetic, Scale Reading, Obs-C and the k-tests all show small loadings. 
The three spatial tests have a prominent additional group factor, which is present in 
mechanical comprehension (SP 122 and Mec-B) though not in information, also in 
G-5 Matrices and Obs-C. Finally there is slight evidence to support Thurstone $ 
factor of perceptual speed in clerical and other tests which involve rapid matching or 
identification of verbal, numerical, or pictorial material. Actually its make-up 15 
more reasonable in this than in the previous, centroid analysis, and though its 
reliability is low, it is probably significant. By Burt and Banks' formula, with 
n=5 and s=3 (since g and 1 or 2 group factors have been extracted from its component 
Eu the S.E. of a zero loading is -081. Four of the five loadings exceed twice this 

gure. 


Table IV shows that g covers 48:9 per cent. of test variance and the five so 
factors together 28-2 per cent., total 77-1 per cent. Corresponding figures for t р 
centroid analysis were 43-8 per cent, 27-4 per cent, and 71-1 per cent., thoug 
comparisons are hardly legitimate both because the centroid analysis combine" 
together certain pairs of tests here analysed singly, and because it was base o 
product-moment coefficients. It may be noted that neither analysis gave any 
indication of formal factors, although the tests do include open-response and variou 


types of multiple-choice.items, and although some are predominantly * power, A 


others are *speed' tests. Probably the functional content quite outweig! 
grouping according to form. 


V. FUNCTIONAL CONTENT OF ITEM FACTORS 


The overlapping of items and tests could be approached in a numb : 
Here, each test was first treated as an additional Matrices item, and its corre 
found with the four centroid item factors. These correlations were then wen the 
those of an additional test in the group factor analysis. The results appear ® 
foot of Cols. 9-15 in Table II. 6 

8 


As might be expected, the first item factor is largely composed of g. Its loading E it 
corresponds nicely with the loading of -84 for Matrices total score in Table IV. hnical 
embodies a considerable amount of k also, and shows loadings with v and the tec tified 
T n are probably insignificant. Only 12 per cent. of this factor cannot be iden 

unctionally. 


К on, 
The bipolar factors are more complex. Their group-factor loadings are mostly 1 15 


significant, though they range up to -28. Had the factors been appropriately rotated, Hed 
conceivable that one might have been fairly highly k-saturated, another might have стосе 
all the у or n content, and so on. But the total functional content is so small that it see our 
hazardous to attempt this. The total content (in so far as it can be identified БУ ont. 
reference battery) is -260 x -88 + -063 x -10 +--041 x 13 + -033 x -11 = 24-4 per “oly 
The difference between this and the communality of the item factors, 39:7, presu ctor 
represents formal factors. That is to say, the combined variance of the spurious fa in 
introduced by the imposition of a time limit, and of common factors in all items 9 per 
groups of items which have no identifiable functional meaning, amounts to some 15 A 
cent. (It may be rather larger if, as suggested on p. 5, the 39-7 is an under-estimate- «t 
further attempt to discover, and assess the size of, the component elements of this 15 per ° 

is described in Section VII. 
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VI. FUNCTIONAL CONTENT OF SEPARATE ITEMS 


The tests were divided into five blocks corresponding to the five group factors 
(omitting G-5 and Matrices total), and the g-content of each item found from the 
block with which it gave the lowest correlations. Its group-factor content was then 
determined from its residuals. Table II, Cols. 9-15, lists only those group-factor 
loadings which are at least as large as their presumed S.E.s. 


As would be expected, the g-loadings parallel closely the Item-factor I loadings, but are 
mostly rather smaller, since only 74 per cent. (i.e., ‘863 of Factor I is g. Loadings on 
gange from "41 to zero, hence we can almost certainly conclude that some items do demand 
m Spatial visualization than others. Sets C and E tend to be more spatial than B and D ; 
im this could have been deduced from the k-loadings of the item bipolars (thus Factor II, 
Which Sets C and E have positive loadings, correlates -22 with А). It does not seem 
Possible to tell, from inspection, which kinds of items are likely to be most spatial. — — 
con Several items, particularly in Set E, obtain v-loadings. Though unexpected, it is 
aye cule that such items demand a species of logical analysis in which the verbally minded 
loader advantage. Seven out of the 11 most difficult items in the test are among these 
$ а ed ones; this corresponds to an ғ; of ·74 between difficulty and v. The n-factor 
EA Small loadings for five items only. This may be chance, but inspection suggests 
e ү ey may involve counting or other mathematical thinking to a greater extent than 
not est of the items. The perceptual loadings are interesting since they are mostly (though 
y, invariably) confined to very easy items—thus confirming Thurstone and Shaefer's 
pothesis. Four of the eight are among the eight easiest items, yielding an r; of -63. 
NS meu this association is statistically unreliable, but it is suggestive. Conceivably if 
doctos been able to analyse more of the easy A and B items, we should have found this 
r playing a more prominent part. і , : 
"This € most puzzling result is the number of items showing technical-factor loadings. 
mere]y ught not to Occur, since the test as a whole is free from the factor. Possibly it arises 
avoids because mechanical comprehension tests are k-saturated, and it would have been 
yn ed had the £ group factor been extracted before instead of after the technical factor. 
techni, Stretch of the imagination can items B 8 and 6, C 5, D 10 and 6 be seen to involve 
а S knowledge any more than other adjacent items. Most of the remaining loadings 
Very small, as also are the technical loadings of the item bipolar factors. — 
to Den the variance of g and the group factors in all the items is totalled, it amounts 
factors) Pot Cent. That it should exceed 24-4 per cent. (the functional content of the item 
of in is quite correct, since the specificity of an item might well consist, in part, pigor 
in Col € familiar group factor. This would explain, also, why some of the communalities 
. 15 exceed those in Col. 8, though probably the main reason is, simply, unreliability. 
Perhaps the chief finding of this section is that the functional content of Matrices 
Progr as distinct from their formal factor loadings) does not alter markedly as one 
See There will be a slight tendency for very low scores to be based on 
tenden ual factor to a greater extent than average Scores, and a rather more marke 
ау, for very high scores to be v-loaded. Also, of course, the Duke 
all ley шоп tends to rise from beginning to end. But generally speaking ше 
From ts Covered by this investigation represent much the same mixture of g an t 
Of conten, Point of view the Matrices test probably shows rather better homogeneity 
is likel ent than does the Terman-Merrill scale. On the other hand онт a 
iffere A to be much more free from formal factors, for these undoubtedly do have 
nt effects on different levels of Matrices scores. 


item 


VII. FINAL ANALYSIS OF FORMAL FACTORS 


Since į Il 3 further, a 

fresh Ce it seemed impossible to resolve the original item factors any я 

origina emp Was Дале first extracting the g and group factor Бос cd 
item inter-correlations. The residuals, which were presumably n 
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by formal factors only, were then subjected to centroid analysis and three a 
extracted. Their variance was 8-9, 4-2, and 4-7, total 17:8 per cent. This is a little 
higher than the 15 per cent. previously allowed, but naturally so because the analysis 
has gone much further. Instead of four item factors, we have removed g, five group 
factors, and three more bipolars. 


These factors are not shown here, since all attempts to rotate them into logical structure 
proved fruitless." They did however suggest that the structure consists of : 
(i) the bipolar difficulty factor ; oe 
(ii) a general factor in almost all items, representing the test’s specificity ; wit 
(iii) separate group factors in the B, C,-and D sets, and possibly two group facto 
E 1-6 and E 7-12. 1 
Here we have seven factors. It is hardly to be wondered at that the original four item 


factors, in which not only these but also the six content factors were amalgamated, were 

difficult to interpret. А and 
That No. (ii) exists, i.e., that there is some positive overlap among all items ovos tem 

above their common g, k, etc., content, was proved by studying the residuals of each 1 


2 Т С ) ; as 
with Matrices total (after removing g and k). The mean variance of these residuals Wê 
4-5 per cent. 


Since neither centroid nor group-factor techniques can readily cope with a general 
+ bipolar +- group-factor structure, it was eventually decided to use the тош 
Item factors П --1ПЇ (cf. p. 6) as an approximation to the bipolar difficulty ir io 
and to remove this first, The iesiduals were then analysed by Burt's group: ac = 
technique. E 7-12 no longer showed any common variance, and £ 1-6 overlap? 
closely with Set D, hence three, instead of five, group factors were sufficient. 
A 11 fitted in with the B set, but A 12 more closely resembled the DE set. r 

The results appear in Cols. 17-21 of Table II. A few of the loadings are арпо! 
for example, E 5's general factor saturation of -81 and D 5's group factor o! ` 
at the tail end of such complex manipulations of rather unreliable inter-item СОГ 


n loselY 

EA be ренеп, The general factor loadings for the 36 items agree йш summed 
vlatrices total score residuals (г, = :76). W i e important, t о 

communalities for the six functi d В pet 18 mor R th thos? 


à onal and five formal factors agree quite well W ; 
the four item factors (r, — -82), though they tend to be over-large. he group 
The variance of the difficulty bipolar is 5-9, of the general factor 7-5. and t ted, the 
factors 9-2, total 22-6. If the two abnormal loadings, just mentioned, are omit o large: 
general and group factor loadings drop to 5-9 and 7-4, the total to 19:2. This is still t9 should 
We know that the general factor variance should be near to 4-5, and that the to Гош, 
not much exceed 15. But considering that we have isolated 11 factors in P ps neral T 
the exaggeration is not unduly great. Our main conclusions are that the bipolar 7 Ü 
ER Miei give a fair approximation to the structure of the formal factors: " ry be 
ese three components are roughly equal in variance in the average item. 


Hari | 
ш formal factors linking smaller sets, or pairs, of items, but they are probably of ^ 


Broadly speaking then an average Matrices item is determined by: 8 itive 
cent., k or technical 5 per cent., v or 14 per cent., perceptual 1} per cent, P'but to 
or negative difficulty factor 5 per cent., Matrices factor (common to all items ont, 
no other tests) 5 per cent., Set group factor 5 per cent. The total is 42 pet ^ oup 
еш Specificity 58 per cent. Had the test been given to a more represent 
P 005 the last figure would drop to 32 per cent. and, so far as we can РЕ 
s E increase would have occurred in the g-variance. Butas we do not 

ctive variables are operating in our population, this is uncertain. 


Т0 what 


1 ep ; re о 
and Ga маз positive for all items except E 7-12 and С 12. It was presumably а miche grou? 
factor (iii) D. ‘This oe pent ny high loadings to the D set of items, i.e., it represen 


plains why it could not readily be resolved by rotation. 
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VIII. CONCLUSIONS REGARDING FACTOR 
ANALYSIS 


Be iub exploratory investigation has been crude and not very accurate, and it has 
(c ERE p by inadequate tests of significance. (For example it is quite impossible 
But this w ie Matrices items differ significantly in perceptual factor loadings.) 
аге anal le seem to be inevitable when tetrachoric coefficients of poor reliability 
rox, - ‚ and when the number of variables is far too large for successive 
techniques ion to communalities, or for the application of more efficient iterative 
Variables c m extracting the centroid factors, doubts frequently arose as to which 
Satisfied the reflect, and there was certainly more than one possible solution which 
rigley (30) outenon of positive column totals (disregarding the communalities). 
should aim has recently discussed this problem and concluded that the factorist 
loadings in _to maximize the variance of each bipolar by reflecting tests with small 
even if thi such a direction as to increase the column totals of tests with large loadings, 
M 5 does not produce all-positive totals. 
асое оГ these drawbacks the study has thrown much interesting light on test con- 
relative varia ems in general, and has permitted for the first time an assessment of the 
e information: of functional and formal factors in a. widely used group test. Whether 
Structing a n on obtained is of sufficient value to justify the work involved when con- 
е 50 best ew test is more dubious. There one often tries out a hundred items to pick 
themselves t ТАк factorization would be out of'the question. The analysis of items by 
em-analysis es Т5 very little which could not have be£n learnt more simply from stock 
Structure like RU EIS It appears to be capable of revealiifg only major defects of 
Unctiona] cont he difficulty factor in Progressive Matrices and the prominent variations of 
The incl ent in the Terman-Merrill scale. . ч : 

though ane usion of a battery of reference tests in the analysis constitutes a powerful tool, 
Broup аа аР that is very time-consuming. At least three tests must be given for each 
before we is that may be involved. Moreover, very large numbers of testees are essential 
f the Prese an be certain, say, that an item designed to measure g is overloaded with k. 
chary ofe nt writer had been commissioned to revise the Matrices test, he would be extremely 
Probably ee or altering any of the items on the basis of the results given in Table II. 
of functional Would regard as the weakest items those that showed the largest proportions 
But this Barden actor and formal-factor variance, in other words the smallest g-variance. 
б Short batt riance could have been obtained very easily by ordinary item analysis against 
actor апа ery of tests providing a reliable external criterion of g. So that here again 
alysis seems superfluous. 


One w; 5 as ү 
Mited Mr making such research more manageable would be to combine items into 
er neous as possible in form, difficulty, 


ind COntent of groups, each group being as homoge r c 
t each of p For example, the four easiest, four medium, and four most difficult items 
0 aven’s sets might be treated as single variables. It would still be necessary 


IX. SUMMARY n 


LET aor 
fa Ве application of factorial analysis to a set of test items (or to the sub-tests 
nd eth?) Will help to show, first, whether the content of the test is homogeneous, 

n T different items or sub-tests measure largely distinct abilities. It is 


7 88ested th, 
(ош that the first or general factor running through all the items or sub-tests 
i ignificant bipolar factors 


obtai 
(before tain at least four times the variance of all the si і 
he test is to be regarded 


An Application of Factorial Analysis 


2. Secondly, it may indicate whether unwanted ‘formal’ factors are present 
in some or all of the items. These include factors corresponding to the form 1n 
which the item is set, speed factors, or a factor contrasting difficult with easy items, 
etc. Such factors probably play an unduly large part in most multiple-choice b 
They are contrasted with "functional? factors, which have useful psychology 
meaning, such as g and the recognized group factors, v, 7, k,etc. Itis possible tha 
some of the familiar factors such as W and F (fluency) and P (perceptual speed) are 
primarily formal. А is 

3. Such analyses are extremely time-consuming when the number of items К 
at all large, and the conclusion is reached that they contribute little which could me 
have been learnt from stock item-analysis techniques, unless it be certain gross 
defects of the test’s construction. Items must be inter-correlated by tetrachorr 
technique, and the resulting coefficients and factors tend to be very unreliable. nd 
is suggested that analyses of groups of items, homogeneous in form, difficulty, Sent 
content, would be more economical. Several overlapping analyses at differ 
difficulty levels may be essential for a test covering a wide range of ability. 

4. Functional and formal factors cannot be distinguished unless the. 
analysed along with an extensive battery of reference tests which will identi 
g and group-factor content. This too is seldom practicable, and is hard 


if ordinary item analysis is carried out against an external criterion. lem 

5. Nevertheless heterogeneity and formal factors constitute a serious prob sae 
as shown by the non-linear regressions frequently noted in following UP. Army 
Navy tests during the war. An exploratory study was therefore made of on 
used test, the Progressive Matrices (1938), given with 20-minute time limit. 

6. Forty-two of the more difficult items of this test and 21 other referenc neral 
were analysed among 640 R.A.F. recruits—a group somewhat superior to ie EET of 
18-year male population. The tests homogeneity, as shown by the varianes jna 
general and bipolar item factors, is low in this group, but would be just ade easy 
representative population. Items of medium difficulty tend to be superior 
and to very difficult items. à 1 

« difficult 


7. Owing to the imposition of a time limit there is à prominent ear 
factor contrasting the earlier and later items. Other formal factors apP in an 
include a general factor in all items (corresponding to the test's specificity is of 
ordinary analysis of tests), and separate group factors in most of the main tor in 
which the test is composed. The variance of each of these three kinds of fac 
the average item is estimated at close to 5 per cent. t and 

8. The functional content of the items is analysed into g 18:4 per epe in? 
group factors 8:6 per cent., but the figure for g might be more than 
representative group. Different items depend to different extents on Y nko ppe? 
and perceptual speed group factors ; but k is much the most important. y 


Im at Si os 
This research was completed before I knew that Miss Keir had carried out а some ces | 


investigation among Secondary Modern Schoolchildri ir. * The Progressive зеде 
Applied to School Children,’ this Journal, 1949, II, ме (9 де of the two studies M 5 Y 
confirm one another. Thus the order of difficulty of items among adult recruits A 
correlation of -94 with that among Keir's boys. Nevertheless some of the misplacemen | D ies 
to which she draws attention are reduced among adults ; for example the mean difficulty 0 er 
becomes greater than that of C items. Although the general factor-saturations of parse 
differ considerably at the child and adult levels, my second and fourth bipolars sub-divi ral fact he 
into the same groups as Keir's second and third. In both researches the reliability; gently 9 ty 
saturations and correlations with other tests are reduced by the restricted range © ай nigh 
testees. But my correlations with eleven verbal spelling and arithmetic tests are somew ven 
of the en j^ p as compared with her average of 44. And the intern the odd © 

Я own е -Ri HA wil 
coefficient of -76 eae Ua Enos at M ы рые 
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to enter chiefly into the most difficult, and perceptual factor into the easiest, items. 


Early items are also less g-saturated than later ones. 


But on the whole the test does 


measure much the same combination of functions at all levels (within the range covered 
by this group of subjects). 


9. 1f, in a test given with a time limit, the formal or functional content does 


vary significantly, it is essential that the items be thoroughly mixed (e.g., in cycle- 
omnibus pattern) and presented in approximate order of difficulty, otherwise different 
levels of scores will measure different combinations of abilities. 


10. н 


- Guilford, J. P. (1940). * Human Abilities.’ Psychol. Rev., XLVI, 367—394. 


v ол DA ч oo ы 


* ri B К 
рае С. Е. (1950). A Psychological Study of Methods of Predicting Succe 
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A METHOD FOR COMPUTING PRINCIPAL AXES 


By L. F. RICHARDSON 


I. General Explanation. ЇЇ. An Example Begun. 


I. GENERAL EXPLANATION 


The following paper describes a working method for calculating the principal 
components of a correlation or covariance matrix, For reasons which will presen 2 
be obvious the procedure adopted has been called * purification." A full acca 
of the principles involved is published in the Royal Society’s Transactions (Richar 1 О 
1950). The present note is intended as a brief guide to the special сазе for whic M 
given n X n matrix K is symmetrical. Let the latent roots ^ of K be arrange R mns 
APO... > №; and let them correspond respectively to latent colu 
Pi, Pz, - . . , P, by way of the definition 

KP; = y Pj (О.у + / шаг 

The method of purification is a generalization of a routine which 1s ambe 
to psychologists, and which they usually trace back to Hotelling's paper © 
that is to say, the computer begins with a guess Хо at a latent column, and he p 
by iteration. In Hotelling’s method Xo is multiplied repeatedly by К; Br 
P, is obtained. My generalization consists in multiplication by K — el. where © pal 
guess at an unwanted latent root and 7 is the n x n matrix having units in its Prom to 
diagonal, and zeros elsewhere. The presence of p gives the computer To. then 
mancuvre. For example, if he desires to remove P;, and if he can put = ts gives 
he can compute (K — 2;/)X» , and this does not involve Pj. Chemical langua£ esired 
suitable hints. Thus Хә can be regarded as a raw material containing ® | neous 
constituent, which might on occasion be P,, together with a lot of miscell B 
dirt consisting OR EI АРД, . +, Pia. The multiplier (к —ÓIXK —^* 
(К — 7, 11) would remove all that dirt. (xz e 

b. To test whether a column Z is a pure latent column, let each element 017^ 7. | 
divided by the corresponding element of Z, and denote the n quotients by F jatent 
If the n elements of KZ + Z are equal to one another, then they are equal 10 гасі, 
root, and Z is its latent column. This * division-test? is already stan ard P avers” 
The only novelty is the notation KZ  Z. There is no need to consider the c? 
ence of infinite sequences. а О 

The iterations are shortened if the raw material Y» is rich in the desired oor dich if 
P;. Itis customary to start from{1,1,1, 1 1}. But this, thous? aterial 
some constituents, is poor in others. ‘It is preferable to choose as the 14 estio" 
the appropriate latent column of some matrix which resembles the matrix 18 а ВШ 

Rayleigh’s mean is a notable detergent. If а column X consists mainly 1З 
ош , with only slight contamination by the n — 1 other P’s, then ud 221 
mean X K X / XX is oft imati sym 
used у Menos Коно weh. ous xor Hee bir s all Шш 

precept of the purification method is that the wise computer locate T 
pue Toots roughly, before proceeding to find any latent m accurately qq of 
with MM with the ‘initial groping,” which is a game of skill, and 
h e ‘fine purification,’ which is a routine. For any one latent ГОО» 
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of groping ends, and fine purification can begin, as soon as all the other latent roots 
àve been roughly located. 


. Some well-known facts may here be usefully recalled. There is a quadratic form in the norma- 
lized test Scores which represents an n-dimensional surface over which the frequency is constant. 
The matrix of that quadratic form is the reciprocal of the matrix of correlations (as was shown by 
> Pearson, 1896). It follows that the semi-axes of the hyperellipsoid of constant frequency are 
Ум, VAs, VÀs...., Va. We may accordingly expect that the least latent root A» cannot 
js ПеВаЦуе ; for, if it were, then the frequency would be finite in some directions at infinity. This 
э owever, only an expectation, not a definite proof, for the following assumptions have inter- 
Bened : (i) that the persons tested were infinitely numerous ; (ii) that their distribution was Gaussian- 
rmal ; (iii) that the correlations were not rounded off. oe È 
h iEn interest attaches to 1/(u/A,); because, if this quantity is practically zero, then the 
Бурге рѕоіа goes flat; so that n — 1 suitably chosen co-ordinates suffice to describe the distri- 


Il. AN EXAMPLE BEGUN 


llustration let us take Burt’s (1915) study of the correlations between 
{5 for eight emotional traits. Let us call the matrix of observed correla- 


For i 
aSSessmen 
tions К. 


TABLE I. CORRELATIONS BETWEEN EMOTIONAL TRAITS (BURT) 


1-00 0:83 0-81 0:80 071 0:54 0:53 0:24 

0:83 1:00 0:87 0-62 0-59 0:58 0:44 0:45 

0-81 0:87 1:00 0:63 0:37 0:30 0:12 0:33 

K = 0-80 0:62 0:63 1:00 0:49 0:30 0:28 0:29 
0-71 0-59 0:37 0:49 1:00 0:34 0:55 0:19 | 

054 0:58 0:30 0:30 034 1-00 0:38 0-21 

053 0-44 0-12 028 0:55 0:38 1-00 0-10 

024 0-45 0-33 0-29 0:19 021 0-10 1-00 
Пы matrix, К, has also been studied by Holzinger and Harman (1948, p. 176). My 
© differs from theirs only by the fact that I have put units in the principal diagonal. 
stock A Material is needed for purification. The experienced reader Шау Daven 
the give the latent columns of some arithmetical matrix which loss y resembles 

have 4.0 K. If So, he would do well to take them as his raw material. Here I s 


Which? be content with a model which has only a distant resemblance to K, but 
Matrix ^ SO simple that it can be analysed thoroughly by algebra. It is the n x n 
and to e called 4 for * average,' and defined to have units on the principal diagonal, 
A ‘ave the same number « in each other site. 
‘trial ; Standard method for analysing A is to post-multiply it by the column 7, called T for 
> And defined thus : 
Т = { 1, a, 03 3, wf,..., 9-1 }. ) 
that eut bracket is used when what is really a column is set horizontal. It is then ae 
Poly он Roe the п elements of AT + Tis equal to 1+ a(o о? +0? + ott. 33 ELSE a 
De a Compre, In order to provide latent columns in sufficient variety, о is at firs den tp 
Separate Pex number, and at the end of the theory the real and imaginary parts ата 
Orthogo ı1 çı this method the л latent columns come out SmI IN n 
Дт valset.. There is a single isolated root, A, = 1 + (n — 1), for whic E 
y en The other л —.1 latent roots are each equal to 1 — cs Fore sp a 
© applic, EE latent columns are shown in Table II. They do not involve а, 
able to other 8-rowed matrices of correlations. 
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TABLE П. ALL THE LATENT COLUMNS OF THE AVERAGED MATRIX 4 


mlt 1 il 1, 1, 1, 1, à ] 
n-í( v2, 0, -1, -у2, = 9; E 
T= { 0, it v2, 1, 0, =, vies E ) 
їй = р 0, E 0, 1; 0, m E 
Tí %, 1, 0, —1, 0, 1, 0, 7 ) 
T= V2, —1, o 1, —\/2, 1, 0, E ) 
TU 1 0, fy) —J2 1, 0, —1, v2, H ) 
T= { Т, =й, ib = 1, —1, 1, q 


. ively the 

The above eight columns may be regarded as jagged waves having wa 7 uffix 

following numbers of crests in the run of eight elements : 0, 1, 1, 2, 2, 3, 3, ily in phase + 
to T is the number of crests. Waves having the same number of crests differ only 

and one of them is distinguished by an accent. K may 


There is a preliminary expectation that the latent columns of the given m latent 
resemble those of the matrix 4; and that the resemblance may extend also not on the 
roots, if in A we put « equal to the average of those elements of K which аго latent root 
principal diagonal. Then « = 0-4603. For this special form of A we have on 


equal to 1 + 7« = 4-22, and sever roots each equal to 1 — « = 0:54. 


з is 
The raw material is how ready for testing. Each of the columns T elon simple 
to be pre-multiplied by K. The operation is easy because the elements of the 
First, 
KT, = ( 5:46, 5:38, 4-43, 4-41, 4:24, 3-65, 3:40, 281 } - 


е 
2 а пе pone) 

These numbers are also, by exception, equal to KT, — Ty. They are not equal 9 9 Rayleigh 5 

so T, is not a latent column of К. Yet they are of the same order of magnitude’ 


ay 
EM 22, Tum 
mean TEEI T, T, is, by exception, equal to the simple mean, and comes to 4 
be regarded as a first approximation to 24. Next 
02 
KT, = ( 014, 0-59, 090, 0-05, —0-46, —023, — 0:15, 1:07 ; 
LT ү 1:02 
T,— T,— { 010, 0-59, оо, —0:05, 0:33, 023, ©, emen 


A В TET A zero eleme. - 
These ratios are alarmingly unequal. The infinities arise, however, only m UM multiplier 1 
in the divisor. · In Rayleigh's mean the zero elements act, not as divisors, bu 

and so they do no harm. It is 


T,KT, | TT; = 2:648 = 033 . 


The same method has been applied to the columns T^, Ta , Te’, Ta, Тз, Tse 
which Rayleigh’s mean gives for the eight latent roots of K are 


The estimate? 


A= 422 0:96 0:64 0:55 0:52 0:48 0:33 028 
Erom DU ay. fr Te no CG d 


f 
um O. фе 

The sum of the eight estimates is 7-98. The sum of the true 2 is equal to Шш ot d 
principal diagonal of K, namely, 8 exactly. The agreement is close, bu r non f A 
allowed to engender too much confidence in these preliminary estimates ; aged ma” ose 
division-tests was anywhere near satisfied. The seven equal roots of the aver ва f th 
appear to have spread out a bit now that А has been replaced by К; but the seems ® 
Seven roots remains unaltered at 0:54. Altogether the averaged matrix h the ai 
justified as a useful beginning. The raw columns Т can now be purified wit 
preliminary à which they have provided. 
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To find the greatest root № . The other roots appear to form a cluster, well separated 
from %. This allows us to * kill seven birds with one stone,’ namely, to remove most of the 
Pa, Pa, PX, Ps, Ps, P; , Ps, which occur as impurities in To , by way of the single multiplier 
К —0-541. ` Thus let P3 denote a second approximation to Р, defined by 


P} = KT, — 0-54 T, 
It is important to notice that the upper of the two affixes to P is not the index of a power, 
ut is the serial number of an approximation. 
It follows that 
P: = ( 492, 484, 3:89, 3-87, 3:70, 3-11, 2:86, 227 } 
{ 21-55, 20:97, 17-92, 17-46,16:45, 13-72, 12:59, 9-68 } 
= { 438, 4:33, 461, 451, 444, 4-41, 4-40, 426 } 


The elements of KP? + Рз are much more alike than were those of KT» + T». Considerable 
rification has evidently occurred. Rayleigh's mean is 
PiKPi|PiPi = 4405, 
апа is probably correct to the first or second decimal. If further accuracy is desired as to 
thie 1, it can rapidly be attained by repeated multiplication by K — 0:541. Howe as 
ops; Der is about method rather than results, I leave the routine for refining A, aside as 
vious, and turn instead to :— " 
Approximations t its l t column Pg. The column Ta, 
Paneer s to the least latent root *s and its laten 8: ч 
Which yielded the least estimate of », has been chosen af the raw material. The Ed 
every E ltiplier К — ol, is discussed in the full account of the?method. For re асу, 
inte Pete р must exceed 30. + As). If nothing is known about the positions of the 
Mediate roots, As.. . Ai, then the best choice of p, for a single rou lE 
омос Scenes about 20, Repente d ma crus of ie cuter 
Yield Ps; but the process would be insufferably slow, t С 
А Iti formation that there 
to às. It is better here to take advantage of the special in А 
0 latent root b ; he purifier need not be reliable. Two 
inh etween 1 and 4-2. In this gap the pu P 
pliers can be : ; ge P,, the other to remove Pa, Ps, Ра, 
used in conjunction, one to remove Fı, Жен ns ded 
For da 2». The multiplier (K — 4-437) will remove P, neatly, and is relia 3 
thi plier ( 8 nt p must exceed 
30, 5 Purpose of removing P,, Ps, Ps, Ps» Ps, Pz, the constant р 


Fora single multiplication by K — pZ it would be suitable to take p = 1(2% + As), 


ith o eS i i i e. Actually I began 
With o SS to 0-73 on the uncertain information available at шаат мааа 


AF a р it became с th 
у and afterwards d d o to 0:6 when it [ 
Vents us mule Q GA iot ам sender die аара 
© ampli Mplitude of P, is taken as the standard, then the m ! EO eae 
To герге des of P ,, but greatly magnifies the amp t 
(к Se multiplication, Dy de das iion, The combined ШШЕ (С E 
Som 1) is reliable everywhere, except in the gap where there are no laten un ere 
after ү Cr desires to multiply by a high power of a matrix, obtained by e Eee 
him to уе Aner of Hotelling (Holzinger and Harman, 1948, p. 166), it wou d 
the trees Out a high power of (K —4-431)(K — 071). Неге, however, in ©, ы 
puri y p» I have used multipliers that are linear in K. It is парозе о аа E 
443 is an 1 cannot be removed once for all at the outset. This is impo e vs Е 
Small inaccurate estimate of м. But even if A, were known p > 


і е ігасеѕ 

d г would tides of P, would creep in whenever the digits were curtailed, апа (KO) 

and аре Magnified in the subsequent multip mand diminishes, 

May hi : б these circumstances the 
Mputer RE the steady decrease in the other impurities. In 


ti timate of 
Aa opp tiplicat : Е ood, for it increased the estima ilar 
«йур ined b on by (K —0-7/) did harm as well as g od ion by (K — 0-61) had a simi 


Y Rayleigh’s mean. A subsequent multipl 
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TABLE Ш. SUCCESSIVE APPROXIMATIONS TO THE LEAST LATENT ROOT OF K 


Estimates of Latent Column Ps Estimates of Latent Root As 
ivisi ;/leigh's 
How Obtained Symbol TE OIE test i| Ву Ваа 
From matrix 4 JC Ts —0:06 © | 0-28 
EO г. Pi —3:6 © | 113 
учуб уы .. as RS ESI 0:91 | 0:13 
KP2—07P2.. a. .. P3 —2:52 0:054 | 0-11 
pp 2. Pi —0-174 0:212 | —0-005 
iare Waza с. < [+ „РЁ —1273 1-978 0:031 
KP§ — 443 P3 E sor Pi | —0:296 0:029 —0-012 
KER OG C eR Pi | —00з8 0:310 —0:013 
КР — 443 Pi bo QAUM PË —0:036 0:274 —0017 


A n З Hei zero. 
I have no doubt that 2, is negative, and that it can be excused as practically 2 


The last approximation to P, is 68 } 
Pi = { 92:5669, 49-2177, —78-3811, —26.0161, —28-1690, —26:9881, —29:8714, 420 


F by 
To find the second latent root it would be suitable to multiply Ti repeatedly 
(K — 4-43Г(К — 0:31). 


But perhaps enough has been done to illustrate the method of purification. 
I am obliged to Sir Cyril Burt for various guidance. 
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ч ۳ ч fact 
Nore.—At Dr. Richardson's request I append the figures I myself obtained for the 


saturations (latent vectors before normalization) and factor variances (latent roots) TO i hano 
largest factors. These results may be of assistance to any reader who desires to try us ptained | 
at Dr. Richardson's instructive method of calculation. The figures given below were nS te 
suit the requirements of Karl Pearson's “ method of principal axes” ; they Were CO 


weighted summation with unity in the leading diagonal (as in Table I above). ий 
Trait 1 2 35234 4 5 6 7 8 4:439 
Factorl .. .. -961 эз 80 780 731 608 5557 407 1413 
Factor П .. .. 047 —145  —452 — 186 `369 244 :677 — 464 
c. Burt 
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ATISTICAL STUDY OF THE RORSCHACH TEST 


By AMYA SEN 
Department of Psychology, University College, London 


I. Probi 
em. I, 
(a) Cognitive A ts and Tests. ПІ. The Validity of the Rorschach Results : 
; (b) Orectic Traits. IV. A Factor Analysis of Rorschach 


ategories, 
S: V. Summary and Conclusions. 


I. PROBLEM 


he Lhe Valid: 

i alidity 

n [е foll owing A the Rorschach Tests. The object of the investigation described 
ained aper was twofold : first, to compat the validity of the results 


on with th : z 
[3 € Rorschach inkblots (a) when the test 5 scored according to 


~~ Of the 5 
w Teco, ч 
obj. the Bonos di Rorschach methods, and (b) when it is scored in accordance 
ectivel ggested by Professor Burt (10); and secondly to determine 
opriate classification of the 


a ys mea J x 
Wit us charaeteristios Ls factorial analysis, the most appr’ 

д the с assificatio revealed by the test, and to compare this objective classification 
i ns of personality types which the test is said to reveal when 


interpr 
eted subjecti 
bs The inti Jectively by an orthodox Rorschach technique. 
ot test a Я 
е ? 0t test as standardized by Rorschach and his followers has b 


Stron, most Widel r 
for es) criticized bo et tool in diagnostic personality testing " (7, p. 85). 
y those whose work lies specifically in the 


een described as 
Yet it has been 


hag re cauti 
We r 
х Orkep эх oxen established sabe 

2, p.2 ‚ and whose subjective na 


Scor 213). And G 64 
c ES see, oodenough, after pointing out 
Where des, a little s, агвеју to have been “ developed on the basis of chance re 

v Ore cautiously, that “ the Rorschach method has now reac 


Ore crit; c 
"tical attitude is desirable ” (25, p. 436). 


ture does not attract the scientific 
that the * formule ' for interpret- 
lationships," 
hed a point 


inure + Place, despite ; ress on two main defects. 
th Stig Its re АБ te its widespread use, hardly any researches have been undertaken to 
Wie neuro ions 8 ри and validity by recognize: There are plenty of 
lelq otic and th ne that it * discriminates ’ between various groups—particularly between 
A een Westionaby, Non-neurotic, and between different types of neurotics. But a test may 
an P e Broup.a evidence for ‘ discrimination’ (i.e.. significant difference 
d syn Port obser Averages), and yet possess little or no và 1 diagnc 
`D. вр tomatoja eS» * the published comparisons of blind diagnoses with clinical histories 

Cf. а ву do not validate the test: they merely demonstra 


Goodenough, 25, p. 436). 


The Rorschach Test 


One of the objects of the following paper will therefore be to provide quantitative inde 
tions as to the amount of validity that attaches both to the separate categories and to the te 
as a whole." Р +. М of 

Theoretical Construction. In the second place, as Rapaport points out, ‘ the tie-up / 
the test with theoretical development in general psychology and the psychology of personae 
has also lagged.” The traits or characteristics which the Rorschach test claims to assess coe 
in the main of an improvised list of psychological qualities which Rorschach himself Hi E 
nearly thirty years ago ; and these for the most part assume the popular but highly fatia lacs 
principles so often stigmatized in criticisms of the faculty doctrine as the mane UM es 
and the ‘formal fallacy.’ In consequence, as even the Rorschach expert himsel А Eh 
admits, there is considerable danger that “ the tester, especially the novice, , may be le lain. 
** dream-book type of interpretation " (20, p. 88). Goodenough makes a similar comp 
“There is no obvious resemblance between the nature of the subject's responses ай АУС 
personality traits from which they are alleged to spring. Many of the * signs ° reporte 
stemmed for the most part from * hunches ' " (25, p. 435). As A sed test 

Nevertheless, * the scientific worker ’ is scarcely justified in dismissing à widely u s ments 
outright, without at least attempting to discover what degrees of validity its asse i 
actually possess, or suggesting constructive proposals for its modification and improve main 
With the view, therefore, to making some systematic contribution towards these у ES 
problems, a number of researches have been planned and carried out in the lao a 
University College. Of these, the earliest, so far as the Rorschach test itself is соп А 
was that undertaken by Miss Kerr, who applied the test to 265 normal, maladjus , 
defective children (11). A number of recent theses, published and unpublished, Wo 
children or more frequently on adults, have dealt either with the Rorschach test as such, 
the Rorschach test as one of several methods of assessing personality (cf. 17). 


Il. SUBJECTS AND TESTS 


Subjects. One common line of criticism has been to suggest that He 
correspondences between test scores and personal characteristics might Ба 
rather than of psychological origin. It seemed therefore desirable to 9 est hose 
from persons brought up under somewhat different cultural conditions r 
who have generally been the subjects in most of the researches hitherto 


res. 
; UD Я chach 5 
1 The few investigations that have attempted to assess validity (e.g., by correlating ROU cessment pn 
with an independent criterion) have been concerned almost exclusively W! h thes 


t 
intelligence. The chief exception is that of Dr. P. E. Vernon (12, 13). In Table Me and 
(27) I have compiled a comparative summary of the figures obtained by differen 
have shown that they are extremely conflicting. ‘ect of 

The reliability of the scoring and of the test-results was not a primary objec 
research, though a small sample of the scripts was scored independently by Ramzy, 
are given later in this paper. Miss Hillman has demonstrated that the reliability 0: h tetra ors 
blots, judged by two sets of scores based on half the blots, yields only а moderately E mines 


ды t exal 1515 
In regard to the reliability Mey hologi*y 


EXC с these ^^ nd 
2 In several earlier investigations an inkblot test ha i don schools. In urt а 
however, a composite battery of tests was аруа на isla to that used by big 
Moore, in which the test-material included not only inkblots and s ot patterns, but аб 
or indeterminate pictures, unfinished drawings and stories, as well "as tes! 
these researches the most suggestive was that of H. L. Hargreaves on‘ T е Faculty 9. Йй 
(Brit. J. Psych. Mon. Sup., X, 1927) who used tests of this nature with 131 children in iro 
Inkblots ’ and * indeterminate pictures * both proved to be valuable t. гог two aspec 
tion termed ‘ fluency ° and ‘ originality." E tests p 


p my 
*I should like to acknowledge my indebtedness to all those who have assis din d 


4 te 
investigation, and particularly to Dr. Charlotte Ba : | help 
Statistical analysis and in the writing of my paper. nks, who has given spe 
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According} 3 dy, 100 Indian students were selected ; these com- 
prised с са, ы for the most part between 20 and 25, ошо 
Various subjects either in London or in Cambridge. кү 
Tests. Each subject was given an individual Rorschach бе el 
tests (Group Test No. 33, a verbal group test prepared by Burt P a non-verbal 
Nstitute of Industrial Psychology, and the Progressive Matrices tes sed largely on 
BrOup test prepared by Raven), and Cattell’s test! of fluency (ba NesGo nares 
-ArBleaves' work, cited above). At a later stage they also took two q 
Intended to elicit relevant traits of personality (for details see 27). 


d that 
Ath Scoring of the Rorschach Test. (a) Rorschach, Method. Rapaport has ер hat 
« We lines along which the examiner should organize the scores ” тау 


i * di ions " should be 
at ensions " of the tests, and that, in marking responses, these * dimensions " sho! 
SONA though they were a system of co-ordinates within wh 


Within o 


In everyqa » ich а 
as ау life." IE f i with the 
dark si^ he will be observant in ordinary life; if his responses ae largely on th the dark 
shadows OWS in the blots, then, it is supposed, he will tend to be undu у Pah 
as of litte is own existence ; and зо on. The actual content or ma егі 
become су, Sequence : “ only in very sick patients, or cases NS subject's A 
Constitutes SSY revealing ” (ibid., p. 102). Itis the ‘form’ of 110 S Phe reacts in seeking to give 
Some intense, main basis of the scoring. The general ways in wate ded as a sample of the 
forma 1 uclligible interpretation to the visual stimuli before him ial life. 1 

characteristics that distinguish his mental reactions In zemi interpretation has always been 
Widely erman writers on individual psychology this mode O i personal characteristics 
from tHe eValent : it is admirably exemplified in their method E “among English-speaking 
Writers ûC OFresponding characteristics seen in the person's handwriting., forms of mental activity, 
Which won. Pe other hand, from Thorndike onwards, the notion ola: t. Itis true that recent 
factorin oE alike in every kind of concrete situation,” has been highly ST factors common to certain 
tYpes à Studies have at times confirmed the existence of a few Jo mmal to; the variance has proved 
O be smaji tal process : but, with rare exceptions, their contributi 


"in 190 small for diagnostic purposes. by Rorschach 
None the detailed Delta 6t the Vendus ‘categories’ commonly adopte d Cof 
Standard and of the scoring procedure generally employed, the re 


| on th i o no! at the scores 
üre usual] TKS the subject (9, 18, 19). Here it will be sofia $ x te ш ОКЫ 
A Sn (à) pssified under four main heads : (i) Quantity, 1.., nu Ы s Ё um г va ats. 
r 1 or only p 


A 1 ers the e 
Possibly 170 Or Location, e.g., how far the response cov 


Stitue Ust minor detai essfully the response @ colour, 
Nts пог details, and how succes кше 20 e. references tO 
(b) foro Into a logically articulated whole ; (iii) Dee Ais Saee balance lo СЕС. 


icularly ( Jat 
ng, (d) movement, and particular : 
üs Yw pe whether the things seen in the tests tend to be orig 
Mor pn 1 
Object,” Or less symbolic, human beings, other animals, blooc erue “t will 
€ like, and whether they show marked тереч нү 
», Sven in assessing * contents,’ it is stil the Bee ee 
ol зресі ir direct implica . 3 ave a 
Principles” following enquity, n pos Ct scores эрсе АРЫТ modification 
“с; » Beck’s procedure (18) was used, with two minor 
Attell, | 
2 Thera В, (1948). A Guide to Mental Testing, PP- 177f. ; ¢ and their Causes. ш, 
os E Е * Individual Differences d 
Pp. 365p, ^ L. (1923). J. Educ. Psychol. ‘1091 
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of his * Z-score* was introduced along lines suggested by the present writer, and Klopfer's 
“form level score ’ (19) was used in addition, with a view to determining which procedure was 
most effective. As a check on the reliability of the scoring, Mr. Ramzy was good enough 
to score 25 of the scripts independently : the contingency coefficients for * location ' an 
the various * determinants ’ averaged 0-86 and 0:90 respectively. 


. . (b) Burts Method. The principles underlying Burt's suggestions may be summarized 
in his own words as follows. 


"Inkblots have long been used as test-material, both in experimental and in individual 
psychology. Before James' criticisms ? put the term out of fashion, they were regularly. employed 
to exemplify what neo-Herbartian writers called ‘apperception.’ * Apperception’ may be define 
as the mental process whereby the contents of a particular sensory presentation, at first relatively 
indeterminate and meaningless, become clearer and more determinate, and so acquire for the 
perceiver some kind of meaning, in virtue of a fusion or synthesis with contents already more or ie 
systematically organized in his mind. It is the fashion nowadays to describe the effects of tl E 
process by a popular metaphor from the magic-lantern : the perceiver is said to ‘ project’ * his idea 
and feelings in his mind on to the stimulus he is perceiving—an analogy which easily leads to extreme! у 
naíve, crude, and unjustifiable deductions. After all, thanks to the experimental work of Sharp, 


727, p. 75. It seems more in accordance with the principles of scoring adopted by Beck and other 
Rorschach testers to regard the following types of response as not falling within the scope of Beck $ 
2: responses in which a small detail has suggested a comment bearing no genuine relation to the 
whole blot, responses consisting of combinations of small and large details where these are plainly 
inaccurate, and finally, vague, indefinite responses, such as ‘maps’ ог the like. All these WOU 
receive a Z-value according to the official instructions for Beck scoring. In my own investigation 
both the ‘unmodified and thé-modified Z-score were calculated and compared ; but, as will be seen 
from the final results, the latter seemed unquestionably preferable. 


* James, W. Principles of Psychology, V, pp. 107f. James quotes a long passage from Steinthal, 
who has perhaps given the most thorough analysis of the ‘ apperceptive processes’ and their 
possibilities. Steinthal points out that, in ordinary everyday life, our commonest clues to a тап 5 
character and their place in life are furnished by his “ mode of apperceiving." James comments 
that he himself finds Steinthal's analysis * simply burdensome. But an analytic study of He 
different processes involved is surely preferable to lumping them all together as comparable with 
the simple and direct * projection ° of a slide on to a screen. 


2 Stout, G. F. (1899). Analytic Psychology, IL, ch. viii; (1913), Manual of Psychology, PP. ТАЕ 
As examples he cites inkblots, puzzle pictures, and clouds or constellations in the sky. Seashore 
definition is similar to Stout's: “ apperception designates the fusion of the presentative with 5 
representative processes, including all their affective and conative processes, in the moment ofa T e 
experience. It is the grasping of a new experience so as to give it meaning and clearness. m 
interpretation is always personal ” (loc. cit. inf., p. 144). It may perhaps be added that, since D 
above was written, the term * apperception ' seems to be coming once more into favour : thus 6 
commonly translates Rorschach's terms Erfassung and Wahrnehmung by 'apperception ’, en 
23, pp. 88 and 190), and describes the * determinants’ as indicating ‘ quality of apperception. 


* Elsewhere Burt points out the widely different kinds of mental process that the ambiguous Шү 
"projection" is made to cover. Bell, Rapaport, and Goodenough, for instance, all treat 177 
Rorschach test as a typical “ projective procedure " ; but they explain the term in very different wa 
Goodenough gives three explanatory examples : (i) * We say our soup is cold " : here we a ur 
the “ sensation in our mouths” to the soup outside us ; (ii) “the office-manager, during an at ane 
of indigestion, raises a tempest over a misfiled letter; he projects his feelings on to the ре ег 
and things surrounding himself." But he does not actually attribute his angry feelings to the Som Я 
or the clerk. (iii) “ In some forms of mental disease projection is carried to the point of delnai à 
the patient sees demons or hears voices of angels." Here hallucinatory reality is attribute s the 
visual or auditory image—again a different process. Rapaport, on the other hand, attributes s 
projection not to the person but to the test: “ the concept of projection in projective DE Sr 
is formed on the pattern of projector and screen.” No wonder Bell observes that “ no eng ud 
common definition of what is meant by * projective’ has as yet appeared among those A in à 
these methods." He is quite wrong, however, in stating that “ projection was first USC with 
psychological sense by Freud." The idea of such a process has always been a favourite one : 
physiological and medical writers; and was criticized long ago by James (Principles, IL, Vitia cy 
As Ward has put it, “ those who talk about * projection have usually committed the prior tata 


ээ” 


of * introjection ”. 
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Pillsbury, Bar 
y, Bartlett, a 1 c 
Apperceiving eho none we do know a good deal about the varied processes involved in 
m Megbooks/on буре writers on the Rorschach tests make little or no mention of it. 
E type to domes р паві psychology commonly started with а laboratory experiment of 
Contribution of his we o the student how much of what he thinks he sees is really an unconscious 
206 Observer to Biter Чий, апа how characteristically the interpretations offered differ from 
voodchuck ; ‘eee fo a hunter,” it is said, a particular inkblot ‘suggests a beaver or a 
веһов or flounder о mE a trowel ; to a keeper of pets an Angora cat; to the naturalist a 
The TE ONOWS that E ing as his mind has been most directed to land animals or to fish.’ * 
an ame Series of seni Comparing ше particular ways in which different individuals apperceive 
ep o Dulary ett ocal stimuli, we may often secure a detailed insight into their habitual 
Deriences, and even Th background, their dominant interests and ideas, their past history and 
that Onlents of the * Aes emotional attitudes and complexes. In all such cases, however, it is 
Er 5 Provide us with ү! ceptive contributions,’ as disclosed by the contents of their interpretations, 
wily SOciative processes requisite data, rather than the formal characteristics of their perceptual 
labo, natural to classify i In summarizing the inferences drawn according to this principle it 
schep ted by the work he traits in accordance with the usual type of psychographic scheme, 
© and some speci d in the field of individual psychology, instead of inventing some new 
ne In the present la ist of categories, expressly for the purpose of this particular test.” 
Secus trials had кашу the assessments were limited to those particular traits which 
tion Dses were асс bn could be most reliably rated. Accordingly, data extracted from 
Un Perception of Rel under the following heads : Intelligence, Verbal Ability, Imagina- 
his ion, Cheerful e'ations, General Emotionality, Extraversion-Introversion, Assertion- 
aes ional and Серге Sociability, Anxiety, Neurotic Tendencies, and Extent 
ished in the аа Interests. These largely correspond with mental factors already 
Suggest Y countin 5 of cognitive or orectic psychology. The. numerical ratings were 
Tunnis d by Burt « fay the number of relevant items in accordance with the scheme 
8 from Е— as ^ Score for each trait was then converted int terms of a 15-point scale, 
e rated independently by two judges 


ere friends or close acquaintances of 
me roof with them for at 
ir of judges who were well 
groups of 15, 13, and 10 
Wi Пе subj RAE Each was asked to 

“Te made rre in his (or her) group for one trait at a time ; and the assessments 
first Reliability -point scale, similar to that used for ra 


cal 


In the uated an overall correlation was 


betwe f 
TI Cen the two ratings thus secured for every trait, ardless of any differences 
ged 0:712. Separate correla- 


Py (916), « Indivi .. A Study of Apperception.” Amer. J. Psych., VIL, pp. 350-393. Sharp, 

Айй. Viti niividual Psychology Apperception:” Method? Ibid, X, pp. 329-391. D 

10 ( 2-940 ee Study of Some Problems of Реге and a mus A 

Wi Oted fis ML. a also Hicks, Dawes (1914). ‘The Nature an 

on Mer (1992? Seashore, С. 

Th ш ), A naen СЕ, (1908), Elementary. Experiments in d Ch XT id IV 

Í ior c A3 "acti . Expert d iv. 
wchology : A Practical Manual. CIE En ; p iment ван 


in а ercept: 
aqq; рре Ptional Danaa : d 
dd “ception,” contribution in perceptions of equivoca: $ ч ) Burt used 

dividual differences at Liverpool. nd ied 


ition 1 

to 1 blo 2 Material t his own experiments on in 
the sym OS ШОЛ laken from Witmer and Sanford, ambiguous pictures and arti 
eq, the ayer tical п, ured and coloured), which he found in later experiments slightly Superos 
|, (orp tage vale blots of the kind used by Rorschach : oth the average reliability бш 
(whic 6stigag D. 381 ity (0-21) were low; but sex differences were easily demonstrated С^ dont 
S De cand the proc Brit. J. Educ. Psych. (1945), XV, PP: 116f). To control the presentation; 
äng tan be ada TOCOesses commonly involved he employed McDougall's © portable tachistoscope . 
this lat ра to the ordinary lantern for grou on). * subsequently, m MEM 
Roy. det, .CT Wor ee he started a more intensive study of ' the general apperceptive pro : 
Schacp S med the basis of the present scheme. _ 
plished Laboratory Notes on the 


hach oS OF t 
Т; е 
est (19). ре 00 are described and illustrated in unpub C refs 
г the psychographic schedule employed, see 20D Ў 
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tions were then calculated between the two ratings for the three largest groups, each rated by the 
same pair of judges. The figures obtained ranged from 0:450 to 0-855, and averaged 0-642. The 
general size and range of the reliability coefficients is thus not very different in the two cases. Since 
by mixing ratings secured from different judges the size of the coefficients is not significantly increased, 
we may assume that their standards were pretty much the same. In general, traits which are openly 
manifested in everyday intercourse, such as intelligence, sociability, intellectual and cultural interests, 


verbal ability, anxiety, and cheerfulness, were assessed with the highest reliability (about o ity 
iabili 


III. THE VALIDITY OF THE RORSCHACH RESULTS 


(a) Cognitive Traits. (i) General Intelligence. To secure a validating criterion 
for general intelligence, the scores in the two intelligence tests were reduced to 
standard measure, and averaged. These average scores were then correlated, first; 
with the judges’ ratings for intelligence, secondly, with the ratings for intelligence 
obtained from the Rorschach responses with Burt’s procedure, and thirdly, wit 
certain ‘categories,’ scored by Beck’s method, which are commonly claime 
orthodox Rorschach workers to be indicative of intelligence. 


The correlation between the intelligence tests and the judges’ ratings was, for 
the men and women together, 0-671. The correlation between the tests and the 
ratings given by Burt's content method was 0:765 (men, 0-738 ; women, 0-792). The 
correlations between the tests and the several Rorschach categories, scored by Beck's 
method, are set out below in Table I. In each case the coefficient was calculated by 
the full product-moment formula. 


TABLE 1. GENERAL INTELLIGENCE : CORRELATIONS BETWEEN INTELLIGENCE 
TESTS (CRITERION) AND RORSCHACH CATEGORIES 


CATEGORY CORRELATION 
Men Women 

Responses to Whole Blot (W% LE ie T — 060 062 
Large Details (D%) ue Ж e is + ۰102 052 
Large Unusual Details (D + Dd%) - = v :046 :056 
Good Form (F + 95) ·. 27 3 Бе n^ 457 -436 
Movement Responses (М9)  -- i v z 408 415 
Beck'sZ-score .. s e d E ix 221 403 
Modified Beck's Z-score iac M EE. 409 415 
Klopfers Form Level Score .. He ni p -500 +586 
Total Number of Responses .. md v. zm 055 053 
Animal Responses and Animal Detail (A + AD%) .. —:365 —:190 
Popular Responses (P%) s: 8 $ :248 —143 


1 Rorschach himself and most other Rorschach workers hold that there are no perceptible difference 
between the sexes, and therefore do not tabulate results from men and women separa ely. dren > 
and Moore, however, found marked sex-differences with their tests particularly among chil гсер- 
апа considered these to be analogous to the different sex interests reflected in other tests O аррег 
tion and association. Thus, in experiments carried out during 1914-18, “ where the o eather 
pictures are coloured, the girls were far more prompt to notice the fact than the boys; ОП е 38 a 
hand, the boys made more references to action. Among the boys there were near y three timpal 
many references to the war as among the girls. . .. Among adults, however except foro CUP. Dave 
and domestic interests, sex-differences were much smaller.” Most other workers wit inkblot 
also reported distinct sex-differences, at least among children of school age (cf. 4 and refs. 8, ifferent 
In my own groups, perhaps owing to their relatively small numbers perhaps owingtot 
cultural background, sex-differences in the Rorschach scores were in most instances СО rrer 
slight. It therefore seems legitimate to pool the results from the two sexes in computing the xes 
E тее there is any doubt on this point, I have tabulated coefficients for the two 8 
ately. 
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Coefficients 5 
of 0-255 and 0-330 mark the 5 per cent. significance level for the men and the 


wom 5 
Women respectively ; the signifi 
gnificance level for the average for both sexes is 0:196. It will be seen 


Hat the results оба 

Sive correlati 9 tained from the two sexes separately tei i 
qus percentage coder Ea both sexes, Баг; y tend tor conha E E NE 
е» aby the 209 ава саке of movement. None of these, however, approaches 

y th 1 ў 

aud Voc ed сат e. Traits. The judges’ ratings for Imagination, Verbal Capacity, 
па obtained by B ultural Interests were correlated with the ratings for the same 
Orrelated with zi urt’s method ; and the judges ratings for Imagination were 
wo Rorschach categories claimed to be indicative of this quality, 


namely. 

~J» percenta E 5 

: tage of ‘movement’ and of ‘form’ responses. (The Rorschach 
o traits.) The 


Scoring give: : 
Coefficients apr comparable to the ratings for the other twi 
erences Veen ya below in Table II. Since here there are no appreciable 
€ been pooled j e figures for the men and women, the results for the two sexes 
Ts into a single group. 
LE II, 
AND TOR ee TRAITS: CORRELATIONS 
CH RESULTS (a) WITH BURT'S METH 


BETWEEN JUDGES’ RATINGS 
OD, (6) WITH BECK’S METHOD 


B. Beck’s Method 


TRAITS | 
A. Content Method Percentage of Percentage of 
I Movement Form 
yy gination م‎ 
ба] Ability `` е 
Wise» v. i. unm 127 —212 
а ar 535 E = 


486 


the level of statistical signifi- 
h higher correlation than 
the results confirm the 


As 

© be 7 

(hace. as with a 
Stan Maginati 

ex da tion the content method of rating yields а muc 


Pectat; pr 
ti ‘ocedu : 
e Ss that an analy For verbal ability and intellectual interests it: ї 
thodox р ап Obvio ysis of content will provide a fair indication of these qualities ; and since 
it seems unfortunate to omit it (as the 


us M " 
component of the material obtained, 


Orschach 
Procedure does) from the inferences to be drawn from the test. 


TAR 
LE 
AND pl: OR s 
RORSCHACH C TRAITS: CORRELATIONS BETWEEN JUDGES’ RATINGS 
RESULTS (a) WITH BURT'S METHOD, (b) WITH BECK'S METHOD 
TRAIT 
A. Content В. Category 
е po Score Score 
Nera 
motional 
alit: , 
Version-Int f Percentage of Colour Responses (C%) 600 198 
ro 
Assert; و‎ Percentage of Movement minus +540 061 
“tion. А Colour Responses (M — C96 
heer Ubmission T € 
пе; s 
-De 4 
Anxigt ES Number of Form-Vista plus Flat Grey 519 082 
E s Responses (FV — FY. 
-138 
OCiabi; s Number of Form-Vista plus Flat Grey 624 | 
i ility Responses (FV — FY 
a ; 230 
‘ra Interest Number of Human Responses (H%) 479 
т — 
663 
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(b) Orectic Traits. We now turn to the estimates for affective and conative 
traits. Once again the ratings of the two judges for each student were averaged ; 
and the averages were then correlated with the Rorschach ratings as obtained by 
the two alternative procedures. The results are shown in Table III. 

The highest correlation with the Rorschach categories is that between the 
* number of human responses and the ratings for sociability. But the low figure 
(230) demonstrates that, even in this field, the Rorschach categories do not yiel 
very useful results. 


Matching Method. It is sometimes admitted by Rorschach workers that the individual 
Rorschach categories, taken by themselves, are of but little use for diagnostic purposes. 
Rather, they say, it is the total ‘ psychogram,’ the Gestalt, as it were, of all the categories, 
that alone will give a successful indication of the real Struktur of the individual's persona ity. 
To test this claim, the Matching Method provides the most appropriate statistical technique 
A verbal character sketch is compiled from each Rorschach protocol ; and independen 
judges are then asked to match the names of the subjects with the several sketches submitted. 


and insight of the judges ; and (iii) demand a good deal of experience and skill on the part back 
interpreter who interprets the Rorschach responses and writes the sketches. One further таун ‘has 
arises from the fact that only very small groups of subjects can be conveniently matched ; anda 
often been objected, on purely a priori grounds, that any small and heterogencous £ schach 
probably be matched with fair success, whether the personality sketches were based on Rorse 
responses, common observation, or indeed almost any relevant procedure. Thus, in mı о 
subsequent enquiries from the judges often showed that they had been frequently £ dec riter 
the sketch as a whole, but by some specific clue which had been inadvertently included by the 

and which was not really dependent on the test-results. 

In the present research the total number of subjects was split into small 
seven in each ; and personality sketches were written from the orthodox Rors¢! y 
The results gave high figures: by Burt's formula the average coefficient Was ` based 
Vernon’s the combined figure was -856. As a rough comparison, personality sketches rent 
on the scoring by content were later read to judges of four of the groups. Here the ee were 
was 1:000, i.e., every one of the subjects was matched correctly, although the sketche 
decidedly short, and dealt with the cognitive traits quite as much as the orectic. vas in the 

These figures, however, are not very trustworthy, partly owing to the difficulties iment. 
matching technique already noted, and partly on account of the conditions of the expe ntents 
For example, when writing up the sketches the investigator had already noted the СОД ona 
oftheresponses. It was therefore difficult to be sure that the sketches, nominally Don tion to 
synthesis of the Rorschach category scores, had not also been influenced by atten i 
the content. Probably a more reliable way of approaching this problem will 
Pd CU Pie US so as to obtain weighted sums for the m 1 

Д о measure ; an і Шет 
ао men for this purpose we must have recourse to 


to 
groups, five 
hach protocols. 


IV. A FACTOR ANALYSIS OF RORSCHACH 
CATEGORIES 


Correlations between Categories. As Burt has pointed out, with the wide Vased 
of Rorschach scores, two modes of factorial analysis are available—the first, tively 
on the more familiar method of * correlating traits, the second on the compara" е5^ 
novel method of ‘correlating persons.’ The latter, however, though highly € to 
tive, is not only less usual, but also more elaborate. and would require à раро ет 
itself. Here, therefore, I shall confine myself. chiefly to results obtaine i 
known procedure. 
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Tetrachoric correlations have been calculated between the main Rorschach 
Categories, thirty-six in all! The two intelligence tests were also included in the 
Series. If we accept the common assumption that the standard error of the tetra- 
choric coefficient is approximately 1:50 times the standard error of a product-moment 
Correlation, then we may regard coefficients above 0-300 as statistically significant. 
th A number of the correlations had minus signs. As a result, on summing the columns to extract 

e first factor, it was found that eight of the categories produced negative totals, namely, Popular 
WS (P%), Responses to Whole Blot (1%), Animal Responses (4%), Anatomy Responses 

:/* Æ), Responses to Form (F%), Beck's Z-score, the Modified Z-score, and, finally, the Average 
ime taken to make the first response. It would, of course, be quite impossible to start with a 
ipolar factor containing 29 positive and only 8 negative saturations. Accordingly, to secure 
Positive saturations throughout the signs for these seven categories were reversed. 


TABLE IV. FACTOR-SATURATIONS FOR RORSCHACH CATEGORIES AND 
INTELLIGENCE TESTS 


Category А. BIPOLAR FACTORS B. GROUP FACTORS 
Intelli I II Tey ee E С Р 
Intel ence Test (Verbal — .. ..| 267 516 —:152 | 040 590 
Form Los Test (Non-Verbaly — ..| 298 675 —0%5 Ol 259 
Good Fora More (Klopfer) ..  ..| 077  :805 —189 |— 2 — eo 
Movement (r) ^) 7 о 1| dej 07 gee |—259 800 
ewAnatomy CANozy a ND у 
Hum y (AN%) Бо a STORE OL 
Gui As Responses os, (RR IEEE ДОШ ys Л 
Architects Responses (4%) ҮШ 95. 340 143 451 346 
B MD n cto So о 3103 (Е 407 219 
(ПЫШ сою (ко Е 291 | 384 174 
bject our (FC) УЛ! 476-088 o 
Vista pro ReSbonses (Fo) |: |13 724 — 288 (—29 909 
Solour (C) imensional) 14 LE 554 364 281 0 284 
face a RU E 62 i 49 
апара (FY) s P of “630 m D | 32 2 
Stra, t "x E We AME Е 2 j D 
Clothing Bau. У Е A a 
Ора ROS e N соз S cay 
Uman etail sponses (P%) 4 -389 —.588 -—-132 E 518 
КЕБИР. or p eee E a ШЫ 
LV Sponses Oloured to Uncoloured Y^ 506 109 |—077 394 
Verg d Se ee "3 s =? їс” ^ ‹ 
оиа Srge Response Time.. ОУ = 125 d ‘068 
оет е. ek үш. (ЗББ OD O00 leo ig 
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Adeo queat 5n l| d 131 | 125 743 
Or ck) М5 261 2 TE 
Any Respon 37 124 | 545 Ad 
Nim; ns, m Ж m alit sd 
nal Detail; (9 Whole Blot (7%) | ‘615 m D 514 
Percent Details (py) ° - 535 036 | 307 
T CCntag ails (Dd) . es 075 90 tld 
Pla Number arie Details (08у |! 303 344 | $5] 375 
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thesi Corre] 3 

i at - 

` Table "m s itself is too large to reproduce here- 
> de Owing p. 223). 


It will be found in the Appendix to my 
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1. Bipolar Analysis. The correlation table was first factorized by Burt's Method of 
Simple Summation, using successive approximation to secure appropriate comming ae 2 
(16, рр. 461f.). Three factors, but no more, were found to be statistically sien Rer 
namely, one general factor, with positive saturations throughout, and two bipolar i oe 
(cf. Table IV A). As has frequently been insisted (16), to be convincing any psycho ОЕ, c 
interpretation of the statistical factors obtained in this way should, if possible, be suppo 


both by internal evidence and by the evidence of external criteria. It will be convenient 
examine the internal evidence first. 


If, as is claimed, the Rorschach test covers the entire personality, and if further we accepi iin 
usual broad analysis of personality into two main aspects, cognitive and orectic respective! a ity 
we might expect the two bipolar factors to represent general intelligence and general олоо indi- 
respectively, i.e., the one would contrast responses indicative of high intelligence with t dit with 
cative of low intelligence, and the other would contrast traits indicative of strong emotional Y 5 
those indicative of weak emotionality. As we shall see, owing apparently to the rather spec is 
diagnostic interests of those who have built up the method of scoring, this interpretatio. 6. ac 
quite accurate. Further, the presence of a comprehensive general factor would seem to be 5 | 
unexpected to those who have hitherto offered a ‘rationale’ of the test. 


2 У A itive 

(i) The first factor obtained with this procedure is a * general’ factor, having Po oy 
saturations ranging from 0-077 to 0-962. But, since so many traits have extrer 

saturations, it contributes only 29 per cent. to the total variance. in all 


From its very nature, a ‘ general’ factor denotes efficiency or favourable. performance here- 
the traits in the series from which it is obtained. A positive measurement for this factor TRorschach 
fore represent favourable performance in the Rorschach test, as scored by the orthodox chologic® 
method.! Can we give this factor any more precise interpretation in terms of familiar p d are 
characteristics? It cannot be identified with high general intelligence ; for the items 56 as well ; 
by no means restricted to cegnitive traits : they cover emotional and temperamental tro saturation 
and in some of the categories dullards may easily gain high scores. Moreover, the factor-se | 
for the two intelligence tests are comparatively low, less than 0:300. 


The highest saturations (over 0:8) are for Total Number of Responses, bk D of 
Unusual Details, and Number of Cultural Responses. These figures, and a o best bê 
the Б rations, both high and low, suggest that the factor can proba f 
interpreted as a general factor of fluency of productive association? 4 н су О 

This is, after all, not an ао Recent factor studies regard a apt 
productive association as the essential factor entering into what, in tradition? that earlier 
language, is termed imagination (cf. 26, pp. 180.) ; and it may be remembere 
workers used the inkblot test as a test of what (in terms of the old faculty PSY 
designated * imagination.’ It is so classified by Whipple ; and Binet, Dearborn 5 aginative 
Eum ns all considered that its most marked characteristic was to display `" 
ertility." 


>. put, 
; 

The category which gives the highest saturation of all is * Total Number of Response js lare 
although the number of responses given by subjects with a high measurement for this t obvious: 105 
nevertheless (as the other saturations show) they do not consist of the easiest, a repro 

most automatic types of response, such as are given by subjects possessing à хед Y^ Fesponses)” 
association (e.g., responses relating to animals, anatomical resemblances, or mere io ses. 
In the main they include a wide variety of imaginative, emotional, or cultural respon 


1 The fact that some of the items have been reversed does not really invalidate this stateme? г onse. 
as ordinarily scored, * average time ' denotes slow response ; when reversed it denotes quick Torker 
Similarly, what Beck scores as a large number of * popular? or stereotyped responses, ot 
would score in the reverse direction as a small number of * original’ responses, an! tal numo 

It should be noted, however, that many scores are expressed as percentages of the to ether thi 
of responses. Now we cannot expect all types of response to be equally available Ya sociali] 
total number of responses is small or large (for example, as Burt points out in dealing wit aple coul À 
responses, “ the proportion of monosyllabic words to total number of words in the saog entage, 
hardly remain the same, when the sample is small and when it is large "). Hence the D increas 
scores having low saturations must consist very largely of those characteristics that cann 
at the same rate as the total number. 


А rsch 
* It may perhaps be noted that other research students in the Department working with the Pictor of 
tests applied to English subjects, both adults and children, have also found a genera 
fluency of response,’ e.g., A. Petrie and S. Cox. 
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Rorschach" я җе 
ach's account of the ‘ dilated ° as contrasted with the * coarted ' type might seem in some 
ut this is not altogether true. The usual 


degree to 
t = 
Measure of the gn the same type of mental process: b 
Movement respo ilated ` type is a large score for * the sum of colour responses plus the sum of 
igh aatan тоз (cf. 20, p. 267) ; but these two categories do not, as à matter of fact, yield 
Not so much by 1 or this factor. Further, the * coarted ’ or “constricted” type is characterized, 
„There B ow fluency, as by “rigid intellectual control.” 
attributed by p to be a somewhat closer resemblance between this facto 
fai A a to his ‘eidetic’ type. In his discussion of eidetic imagery he suggests that 
arily selected ale operate on the higher levels of perception,” 
Sm à great extent n the total number of stimuli presented and the associations the: 
М ishes, interests a oT purely subjective factors such as meaning, values, consciou ur 
үш! the presen nd previous experience.” His two main types, the * integrates ' and * disintegrates,” 
the ing are in fae or absence of these selective tendencies. , The differences in the products of their 
ley have * rich 0 due to the way they select. Thus the ‘ integrates” are the nker: 
and ALE and * see meaningful wholes.’ The disintegrates ' are more objective 8 
*R analytic? It m and experience have little effect on their perceptions ; they are inflexible 
чепсу? would apud seem therefore that the subjects who gain high factor measurements for 
Gi) Th © in many ways analogous to what Jaensch calls the * integrate type.* 

" е h J 
bipolar fact ond factor contributes as much as 19 per cent. to the total variance. Itisa 
i e most ec bas the categories which have large positive or negative saturations are for 
хоме saturati ategories which relate to cognitive characteristics. Those obtaining high 
of. Ting for the S ns are Klopfer's Form Level Score (which was designed to “ get an objective 
ne Form E ESE functioning " (19)), Percentage of Good Form, and Low Percentage 
Буе y. Both the intelligence tests also yield high positive saturations. High 
Re; sponses Low Beck Z-score, Number of Bad 

ap pons NE Percentage of Responses to Wieole Blot, and Number of Animal 

е fa the items involving * details’ (e£. D%, Dd, AD, and * Human Details ) 
of Or represent: It would appear therefore that this 
Attention 2 Rd Burt has called the * * synthetic, Or * integrative ' type 
ontrasted with the * particularizing ` OF * analytic’ type. 

tual, if not always more intelli- 


Th 
Bent. e former t 
> thela Ype of mind is and intellec 
t is apt to be more abstract andi E i some respects 


Analo, ter ti 

(he 08045 to © concentrate rather on practical or isolated detail. 

Satu fines it Y Rt Rorschach, following Meumann calls the Wahrnehmungstypus or Erfassungstypus 
"but the characteristics emphasized by the factor- 


ations ar 


mode of у. 
eb perceptual approach °) ; 
һа (iii) The d no means the same as those enumera 
be in hig E d factor contributes only 7 per cent. to the total variance. The categories 
Ndicative Chie or negative saturations are (0 à large extent those commonly alleged to 

€ affective or emotional aspects of the subject's personality. 


ted by Rorschach. 


1 
Jaens 
ch, E 
з » E. R. (19 b pere 
ui Houa E 30). Eidetic Imagery, рр. 92f. 
М а ы H 3$ i 
арр е рѕ be pointed out that the word * attention ү dere used in he СК. 
nsciousness Pi p. 147f. of 3rd ей). 


cDougall 
c cf. Sto , namely, as the mode of co 
alumana a S the ^ Monal of Psychology, chapte 
«инр, Cistineti alizing? and *particularizing ^ (уре 
d ise iul 1 and between Pene oder konzentrative Aufmerksamkeit. and 
lateq «еа ti merksamkeit (Experimentelle Püdagogik, , L, рр. ‚500/. ). 
el, сс; a ease main forms of ‘imagination’: ап * abstracting type, 
tha buch eining” t etermining ’ type, which fills out given details to make а Че 
t thes; der Psych ype, which creatively combines details not given In пп. argues 
У Oe hologie, Y, pp. 470f.; Sully, The Human Mind, I, р. 364). Pium ae 
: erties of apperception or, as he prefers (0 52У, attention (op. cit-s 1, PP: á 
5 is tw ies in the 
1 s account of S Recount М 


descrip htrast wee to, 
ibi noted by Binet. Jt appears again 1P hi ‘ lich 
x B : an accessible Eng jf it may be 


ds i 
(s eu also 
f 
Adena in 


e 


B an object с a 
jecit late, Myers? ject ’ and ‘ describing a picture, V» 3: 58 
impe ORE consid Introduction to [сеске Peychology, pp. 12 £): MIS Si bs and the sub- 
fece tan Teflectiy, егей Binet’s distinction between “ the objective OF practica. esses its possible 
two nical o in edi type ” of special importance in vocational guidance ; urt stresset ted wit 

fold yok mod ucational guidance (sel ting pupils for grammat schools as COT of this 

ASSification Schools). Miss eee anf ain conclusion consisted in à con 
; and it reappears in Bartlett (7, p. 256)- 
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Burt, in attempting to assess emotional tendencies from the Rorschach test results, follows 
the usual principles described by British psychologists 1 in discussing the influence of feeling ОП 
jmagination and apperception : and lays stress chiefly on “ the concrete contents of the interpre- 
tation." “These,” he says, “ often reveal the dominant emotional characteristics of the more 
active appercipient interests : in an apprehensive mood the child is more likely to see ghosts, skulls, 
or corpses; in a cheerful mood, to see fairies, toys, and Christmas trees. What may be calle 

“hedonic ’ and ‘ valuational’ judgments are especially suggestive.” Rorschach and his followers 
lay more stress on certain relatively abstract categories ; their inferences are based on the allusions 
to ‘colour,’ ‘ movement,’ * shading,’ * white spaces,’ and the like ; and for the rest, they mainly 
rely on interpreting the ideas and images suggested as quasi-psychoanalytic symbols—the metho 

so often stigmatized as the ‘ dream-book mode of interpretation.’ 


In Table IV A, it will be seen, the two largest positive saturations relate to the number of 
responses depending on Colour (C% and C + CF); on the other hand, the number of response 
depending on Movement (M95) yields a negative saturation, though the coefficient is not large. а 
far this might suggest that the factor represents Rorschach's contrast between what he calls the ' ex i 
tensive’ and the ‘introversive’ types. Bryn gives a tabular list of the characteristics suppos io 
to distinguish these two types (23, p. 126) ; the chief characteristic of the * extratensive E sa 9 
be their‘ unstable emotional life.’ However, the highest positive saturation of all relates to ‘clouds of 
and there are fairly large positive saturations for items usually supposed to be characteristic is 
vague, introverted,’ and autistic thinking (e.g., vistas, landscapes, abstract responses, etc.). coms 
type of thinking is, of course, especially characteristic of the neurotic. And much the same tal 
junction of categories is reported by Guirdham, who found that a marked tendency to give iis to 
Responses often accompanies responses of the kind called ‘ Diffused Shading,’ e.g.. references on 
vague and relatively formless objects like * clouds? or ‘ mist.’ * Diethelm also noted a close relati 
between Colour Responses and Cloud Responses under the influence of adrenalin. 


All these considerations combine to suggest that this third factor tends chiefly to contrast 
subjects with neurotic and with лол-пеиғоїіс. tendencies respectively ; and this inference I?» 
as we shall see, confirmed by external evidence. 


Correlations between Factor-measurements and Independent Ratings. with & 
view to obtaining a more objective check on the interpretation of the various factors 
approximate factor-measurements were calculated for the bipolar analysis. 


iti i i i * thei ional elements 
1 writer who first laid stress on the influence of emotiona 
Among British psychologists the wri ‘atures of admiring assemblies anget 


on imaginative combinations " was Bain : “ vanity sets up pictu ves a character to û 
once roused, resuscitates objects in harmony with it; the emotion of terror gives ac naran titious " 
the ideas formed under its influence : ghosts and hobgoblins fill the imagination of the Баро гісі the 
(Senses and Intellect, рр. 614f.) ; indeed, in Mental Science, p. 175, he proposes to ion." Sully 
term ‘imagination’ to the productive process as differentiated by the presence 0 RA. process : 
similarly states “ the presence of feeling gives a particular direction to the imaginati 


every feeling tends to reinstate those images that are congruous with it” (op. cit., b. P: 


? Rorschach himself described the interpretation of ‘movement responses 2S apperception 


` problem in the entire experiment" (9, Eng. Trans., p. 216). Earlier work fly it 
had contrasted ‘colour,’ not with ‘movement,’ but with ‘form’: Rorschach, chien : 
seem for a priori reasons, proposed to substitute ‘movement.’ Burt, however, has gi ; se 
deal of evidence for maintaining that the change is not really justified (cf. 15, pP- i т.” This 
negative correlations are between interest in form and colour, not movement and colou: sre are 
is borne out by my own results. In the correlation table (27, appendix), it will be 90 corres 
small positive correlations between colour and movement, but comparatively high пе айу” egati? 
lations between colour and form. Similarly in Table V most of the Form responses have ation of 
saturations, and * Few Form Responses’ has a positive saturation. The highest согы js а150 
M% is with the matrix test (+ 0-70) ; and its correlation with the group test of intellige oomen 
one of the largest (+ 0:40). This agrees with the view of many Rorschach workers that 
Responses are indicative of intelligence. pos?) 
2 It should be noted that Rorschach’s ‘ introversive ' type is not (as many writers appear tO SP nany 
altogether the same as the more familiar ‘introverted ’ or ‘inhibited ’ type : the latter 


ways corresponds more to his * coarted ’ or ‘ constricted ° type. 1 psych 

1 Cf. Guirdham, A. (1936). * The diagnosis of depression by the Rorschach test. Brit. J, Mee 

XVI, pp. 130-145, and refs. ^ Test” 
ac 


5 Diethelm, O. (1934). * Personality Content in Relation to Graphology and the Rorsch 
Proc. Ass. Res. Nerv. Ment. Dis., XIV, pp. 278-286. 
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Original scores were reduced to standard measure, and then weighted with the 
Saturations for the three factors obtained by the bipolar method. 
. This device for calculating factor-measurements is exceedingly convenient in practice. But 
it perhaps needs a word or two in justification. As a check on its validity, more accurate weights 
were also calculated by Ledermann's formula, namely: ЕК”! = (F'U^F + I)iF'U^, where R = 
the correlation matrix, F — the matrix of factor-saturations, U — the diagonal matrix of specific 
Actor-saturations.! It was found that in their relative size they closely resembled that of the 
Corresponding factor-saturations : in other words, the post-multiplication of F' by К^! or its equiva- 
ent does not greatly affect the final result. The rank correlations between the two sets of figures 
are for the general factor -82, for the first bipolar factor -91, and for the second bipolar factor -90. 
he correlations between the factor-saturations and the factor-weights are surprisingly high, and 
Без! that comparatively little loss of accuracy results from the substitution of saturations for 
actor-weights. 

The approximate factor-measurements thus obtained were correlated (a) with 
the results of the intelligence tests and Cattell’s test of fluency, (b) with the average 
Of the judges’ ratings for certain personality traits, and (c) with the personality ratings 
às estimated by Burt’s procedure from the content of the Rorschach responses.* 

he correlations are shown in Table V. Only traits which provide at least one 
Saturation over 0:30 are here included. It will be seen from the various figures that 
the conclusions suggested by the external criteria fully confirm those based on 
Internal evidence, 


TABLE v, CORRELATIONS BETWEEN FACTOR-MEASUREMENTS AND (1) TESTS, 
Q A) JUDGES' ASSESSMENTS, (2 B) ASSESSMENTS BASED ON CONTENT OF 
RORSCHACH RESPONSES » 


EU." 2 001 RORSCHACH RESON c 


FACTOR 
TR. 
+ 1 и ш 
Bu 
1. Tests 
Intelligence (Matrix 132 507 —031 
Intelligence (Ма 3) » 032 456 002 
luency (Cattell) .. А 485 041 19: 
А B A B A B 
3 Assessments Judges Content Judges Content | Judges Content 
Dteligence .. — ,. — ..| 196 180 50 5385 000 —01 
егсерііо, ; xd -096 <a E ert 
Magination i а р 530 393 146 202. 092. 013 
general Emotionality — ..| 232 3n 156 186 112 188 
Xtraversion-Introversion .. | 149 307 169 BET 02 їз 
*urotic Tendencies e 171 +242 —212  —:319 d 
(ee ee Ei T Lu re e 


ofi ог the first factor the highest coefficient furnished by the tests is the correlation Dm oe к 
Correlation ene highest coefficient furnished by the Judges ratings mon 23 nd ЫШ 
With imagination : ral emotionality and extraversion- ‹ yield 
tests, figures, For the pseu factor the only large correlations are those with ео aren 
all the © two ratings for intelligence, and the content rating for perception o S Fes Significant 
Correlati" er correlations are statistically non-significant. The third factor Jem IDE 
the as With one trait only, namely, neurotic tendencies, as assessed both by the judg 
Sessment made by the investigator. 


1 
Leder xy 
imati tors by Regression. 
Bohomeryi W. (1939). ‘A Shortened Method of Estimating Mental ‘Factor ÛJ os a 
'Ci 


a, IV, pp. 109-116. Cf. Burt, C. (1937). ° Correlations 
p. 85. 


гп e i f the subjects 
during" ЧОН I also attempted a twofold classification, based on my observes classification Ex 
петер е „course of the tests and interviews. The correlations dom V ies 

‘Serial coefficients, and probably less trustworthy than the other figures. 


: 33 


The Rorschach Test 


2. Group Factor Analysis. A supplementary analysis was also carried out by 
Burt’s Group Factor Method (16, pp. 477f.). The original matrix of correlations 
was divided into three submatrices in accordance with the lines of division suggested 
by the bipolar analysis ; and a basic factor and three group factors were then 
extracted. The saturations are shown in Table IV B. 


() The basic factor (F) places the categories in much the same order as did the general 
factor in the bipolar analysis, although the saturations have a wider range. The correlation 
between the two columns of saturations is 0-89. Hence it seems reasonable to regard the 
basic factor as representing fluency of association. ]t may perhaps be identified with 
Hargreaves' group factor of * fluency’ (derived from inkblot and other tests) : he defines it 
as ‘ quantity of imaginative production,” and considers that it confirms “ the discovery О 

an Imagination factor by the authors of A Study in Vocational Guidance."* 


It may be seen that a few of the categories have small negative saturations for this factor, 
namely, Good Form (F + 95), Klopfer's Form Level Score, Form Responses generally (F), Movement 
(M%), Ratio of Coloured to Uncoloured Responses, and low Modified Z-score, as wellas the Matrix 
test of intelligence. This probably implies that the large saturations assigned to the broad group 
factor (G), which covers these traits, really include saturations due to a narrower and more specialize! 
supplementary factor, which, owing to the small size of the sample, cannot be significantly dis- 
tinguished. 
(ii) Of the three group factors the first (G) includes those traits that have positive satura- 
tions for the first bipolar factor and negative for the second. The persons belonging to this 
group tend naturally to generalize (they make free use of broad intellectual concepts), and 
are free from any marked neurotic tendencies. They thus show stable and well-organize 
personalities. The factor itself is perhaps best interpreted as a factor for integrated thinking, 


closely allied to intelligence. 
Categories which are generally recog 


as well as Movemen' 
saturations for this factor ; 


nized as denoting intelligence (Klopfer's FL, F + 96; eco 
gh 


ble estimate of intelligence as ordinarily understood. е Пау 
ther subdivide into two subgroups. 


(iii) The second of the group factors (Е) includes categories, like Colour and Few Form 


Responses, which have positive saturations for both the first and the second bipolar factors- 
According to the orthodox interpretations, it should therefore denote a factor of emotionality- 
i ion, the characteristics having 


Except for * Clothing,’ which has a comparative! i 
de not only * Colour 


high saturations in 1 ) 
and *Few Form' Responses, but Abstractions, Landscapes, Vistas, ‘Surface Grey —аге 
highly suggestive of the type which Burt terms the * unstable introvert,' a type whose emotion- 
ality is of the vague, dreamy, reflective kind. Burt, for example, points out that, wit 


1 Op. cit. sup., p. 24; cf. Burt ef al., A Study in Vocational Guidance, pp. 43-46, and 26, p- 181 


Burt and other writers interpreted the underlying ability in terms of a positive endowment | from 
is implied by such terms as ‘active imagination, ' activity " of intelligence as distinguished 5 i 
“level? (Binet), ‘cleverness ° (Garnett), and the contrast drawn between * imagination ' ап fans 
(Coleridge and Wordsworth, cited by Burt)  Hargreaves, however, finds nothing correspon! of 
to this positive conception, and believes fluency of association to depend essentially upon Ја may 
inhibition : he compares it to the * flight of ideas ’ in intoxicated and manic states; This, it AU 

be remarked, comes near to Rorschach’s account of * dilation * as contrasted with * coarctatioD- ў 


* It will be noted that the Rorschach test, as ordinarily scored, gives little or no information ао, 
special aptitudes. Some light is thrown on these by the scores derived from an analysis of CO 0110; 
Miss Hillman, in an unpublished study with children, considered that verbal and practical Е diat? 
as well as productive and reproductive imagery (the latter including imagery an imme ce t 
and delayed memory), might all be tentatively assessed, but with no very high reliability, “oye! 
perhaps in the case of productive imagination, “which the inkblot material was originally emp 


to test.” 


3 Cf, Burt, С. (1917). * The Unstable Child,’ Child Study, X, p. 70 (Section on * Association ? 


34 


AMYA SEN 


тсе tests, the * unstable introvert ° prefers “ landscapes to portraits, scenes from dream- 
forth rather than from reality a (15). These and other preferences suggest a close parallel 
e kind of fanciful pictures which persons of this type tend to see in the inkblots. 
| en paths last of the group factors (P) covers all those traits that showed negative saturations 
iens irst bipolar factor. The subjects with high factor-measurements for this factor show 
E ency towards particularization, in the sense described above : their responses are 
e almost naively, by ,non-constructive mechanical association. This factor 
am tore seems to indicate a dominance of processes akin to what Burt calls * reproductive 
association 7 and older writers * passive imagination." 
that uc arrangement of positive and negative saturations in the second bipolar factor suggests 
GS mo naits entering into this group factor should also be subdivided. They appear to imply 
E inguishable subtypes, namely, a more objective or practical type, who interpret quite 
ү E N the objective of concrete details which they see in the blots, and a mildly neurotic 
on thi e interpret the details in a more personal or subjective way. Burt lays considerable stress 
1 INS atter tendency, and states that it “ appears to occur much more frequently (a) in the 
Бе а and (b) those persons of either sex who are of a psychoneurotic type and would often 
tendencies d as suffering from * inferiority ° or * superiority * complexes or even mild paranoid 
ambi аа Alike in tests of free association, іп the interpretation of pictures, and in responses to 
ten ett Pictures and meaningless spot patterns, inkblots, and the like, they evince a marked 
emis y to give individual associations and interpret things in terms of their own personal experi- 
distinct; more than half their replies are characterized by reminiscence or self-reference.” The 
In his ion appeared in Binet’s experiments, and was most clearly marked in his tests of description. 
«û his account of the results obtained from his two daughters, Marguerite (M) and Armande (A), 
the rs are told, “ writes much more frequently of her own past experiences than her sister : 
ES ence is not so much due to memory as to interest. So, too, the words used by M refer 
nye her own person." 1 
the eee with the present figures this distinction cannot be pressed. Within this set of traits 
Є restare [Шаг factor shows only two large positive and two or three large negative saturations ; 
1 egree of SERIE small. It is therefore possible that these figures merely point to a certain 
Or three j overlapping, or possibly to a couple of small special factors linking no more than two 
ее items in each case. 
Gd the foregoing factors had been extracted, there still remained a few large residuals 
Süpgest 1219 а definite overlapping between certain categories. The more conspicuous 
Colon relatively specific correlations between * Percentage of Colour Responses ° and (a) 
S CET plus Colour-and-Form Responses’ and (b) * Few Whole Responses,’ and again 
TH Colour plus Colour Form Responses ' and * Modified Z-score.’ ? 
may to © foregoing factor analyses help us to see far more clearly what it is precisely that we 
test qui extent infer from Rorschach scores. Although it is claimed that the Rorschach 
Many pu me whole personality, it is obvious that it fails to give any clear information about 
en ois features. Indeed, it would seem that assessments based on it, particularly 
| Single ng in the usual way, are bound to be somewhat limited in value, first, because no 
^H Personalit and no single type of test-material can really do justice to all aspects of the 
^ i Y ; secondly, because the conventional scoring has evidently been appreciably 
| 
| 


ias та; p 
ed by the special interests of those who have developed the test. Rorschach himself and 


most 5 E $ 
of the more enthusiastic champions of his test have been chiefly interested in the 


1 
Myers? 
here i nt Introduction to Experimenta 
Of artistic a? it may be noted, are analogous to t 
li y be CIE d HOD. Burt believes that, to a large extent, responses 2 
ike (cf. H ified in much the same way as responses made in interpreting pictures, colours, and the 
owever, fou, the Mind Works, chapter on ‘ The Psychology of Art, pp. 280f.). Miss Hillman, 
pa gest ан that the correlations obtained, though usually positive, were seldom high : the 
еріп ations were obtained for the * objective ' and the ‘associative’ types. This seems to 

8 with the inferences drawn above. 

With 


nm К 
а larg, x рав approximate saturations have been inserted to deal with the overlapping cases. 
to fit the ṣo P they could probably have been established more clearly by rotating the bipolar factors 
© 90р factor pattern. Professor Burt, however, has suggested that an even more instructive 

ral stiu. n might be secured if the group factor pattern used as a guide was based upon the 
I; arches i ctural pattern which has gradually emerged from the accumulated results of factor 
Ments for th the past. This, however, would seem to entail a larger number of independent assess- 

€ chief cognitive and orectic factors. 
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hose observed by Bullough and others in their studies 
s made in interpreting inkblots 
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psychiatric field : hence their scoring has largely been concerned with the diagnosis of those 
forms of defective intelligence that are met with in the feeble-minded or the mentally diseased, 
and of the milder types of emotional instability that characterize the psychoneurotic and the 
psychopathic. ү 
The factor analysis shows that some of the interpretations are undoubtedly justifiable, 
even if their reliability and validity are not very great. Thus the kinds of response said to 
distinctive of high intelligence are, on the whole, accurately assigned. On the other hand, 
many of the assignments proved to be ill-founded or even fanciful. The suggestion that 
* M9" denotes ‘imagination’ is plainly negatived by its low saturation for factor I (0:097). 
The suggestion that FY denotes “ lack of activity and listlessness"" seems negatived by its 
high saturation for factor I. The statement that “Total R” is “the best indication O 
intelligence ” seems contradicted by its negative saturation for factor II. The outstanding 
importance attached to C%, M%, and their sum and their ratio, is hardly warranted by the 
_figures. And the reader who cares to compare the saturations for the several categories 
with their orthodox interpretations as given in the Rorschach guides will have little difficulty. 
in discovering other instances where the usual diagnostic inferences can claim no measure О 
verification. Indeed, the large number of low saturations scattered throughout the table 
seems to suggest that many of the scoring-categories fail to confirm each other in the way 
that is claimed. 
_ On the other hand, assessments based on analysis of contents seem far more promising. 
It is possible that a more eclectic type of scoring, based both on content and on the more 
suggestive of the formal characteristics, might, if subjected to a factorial study, render the 
test still more valuable for practical work. No doubt further studies on a much wider 
scale are required to substantiate the conclusions I have drawn. But the results so far 


obtained seem plainly to. demonstiate the value of correlational studies along the lines | 


here described. 


Introspective Evidence. lt is a sound principle that inferences as to the nature both of 
tests and of factors should be supported, not only by internal statistical evidence and by 
external observation, but also by introspective data, elicited from the subjects tested, an 
explaining the processes by which their overt responses were reached. Many Rorschach 
workers, after administering the test itself, conclude with a brief supplementary * enquiry. 
But this is usually directed to features in the blot, not to the mental processes of the subject. 
Its object is “ to ascertain what part of the blot,suggested the response, and whether it was 
determined by colour, shading, etc." : “ questions should be asked only to clarify the scoring : 
enquiry into most responses is uninstructive and superfluous " (20, pp. 97-98). In research 
this restriction seems an unwise policy ; and, even in clinical work, where time is limited, it 1S 
not without serious risks.* » 

“ Earlier investigations," it has been said, “ on problems of this kind made the introspections 
the central feature of the experiment. In using indeterminate material (inkblots, spot-patterns; 
incomplete or ambiguous puzzle pictures, and the familiar alternating diagrams) the primary object 
of the investigators was to discover how, during the attention process, a relatively meaningless 
stimulus acquires meaning for the percipient. Trained most probably on Titchener's four-volume 
Manual, they would have learnt, from his very first experiment on attention (reporting on what i 
seen in puzzle pictures), that * the appeal lies to introspection : . . - here, as elsewhere, the verdic 
of introspection must outweigh all preconceived opinions. è Since then, however, owing argely 
to the reactionary criticisms of the behaviourist school, introspection has fallen into neglect. То! d 
often the most characteristic responses are explained forthwith in terms of a small stock of allege 
mechanisms, of which the percipient is supposed to be unconscious, when, in point of fact, he himse 
may be fully aware of the real reasons for his replies. Most frequently, of course, conscious 4! 

1 Miss Hillman, for example, relates how a psychiatrist at a child guidance clinic, having applied 0 
Rorschach test to a boy of thirteen, reported that "the bizarre responses (illustrated in detail! 
provided strong evidence for a diagnosis of latent schizophrenia ” ; “whereas a subsequent introspec; 
tive study of the child's mental processes showed that “seven out of the eight * bizarre response 
were words that the child had been writing out over and over again in a spelling-exercise give 
that morning in school.” In other instances memories of quite recent incidents, especially at tid 
cinema, ‘ sufficiently explained a number of the children’s more peculiar replies, which WO 
otherwise have been quite inexplicable, and were dropped entirely when the test was repeated. Я 

2 Titchener, E. B. Experimental Psychology: A Manual of Laboratory Practice. 1, Qualitative» 
Students’ Manual, p. 109. 
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unconscious processes both play their part. But the former are nowadays too rarely investigated 
In attempts at validating and interpreting tests " (10). 1 
Many of the responses given by my own subjects could only be interpreted 
correctly by one who had already ascertained the particular features in their cultural 
background which so often prompted their replies! (cf. 27, рр. 33f., 93). Since the 
majority were untrained in introspection, the evidence available from this source 
Was somewhat limited. Nor is a statistical journal the appropriate place for a 
detailed discussion of introspective studies. It may, however, be suggested that in 
future. investigations the following points could usefully be borne in mind when 
ae some of the questions raised by the correlational data and the factorial 
ults, 
As Titchener says, “ introspection knows nothing of faculties.” Hence, so long 
as we think of factors as abilities, we may gain little help. “ What introspection , 
Knows are processes.” Now, however meaningless the stimulus, the mental processes 
It arouses will inevitably be affected by “ the tendency of everything psychical to 
Secure meaning.” ® And “ apperception,” as Stout reminds us, “is the process 
MY ereby à presentation acquires significance for thought," or, in a word, ** secures 
Meaning,” 3 Introspection thus seems essential if we are to obtain a first-hand 
гере of the various forms this process may take. With the blots, for example, 
oe Subjects will explain how the “ meaning” nearly „always presents itself 
mediately, after the briefest glance; others report that it only comes after an 
appreciable interval spent on what Bartlett would call “ rummaging about, 
pm ‘ching among ancient memories now half forgotten. We have therefore to 
an cise between factors operating in and during the apperceptive process itself, 
and Ose that operate outside it or before it ; and between those that are conscious, 
con lake time to evolve, and those that are more or less unconscious, and may 
Кс cquently operate almost at once. Whether clearly conscious or not, all such 
co cesses will have a threefold aspect—cognitive, affective, and conative. The 
Rene Teport, for example, may show that the subject in question nearly always 
am the name comes first, and he then develops the implications of the name ; other 
Oed Will say that they first picture some related incident or object, and experience 
name erable difficulty in finding suitable words or names. But what suggests the 
tone S, the incidents, or the objects? Оп the affective side, an accompanying hedonic 
АШЫ always to be present, though it тау not be mentioned until an pi 
enor IS requested. If, as we are told, attention to dark shading commonly 
toned S an unpleasantly toned * anxiety,’ and attention to light shading a piam. y 
be abl euphoria,’ introspection should, with trained psychological students а cae 
5 ШЕ. confirm it. On the conative side, lack of fluency may or may not n ue 
“е Ort Itions, or to the fact that the subject has to make what Bartlett cal | а 
to dis after meaning.” Free association and other analytic techniques may help 
predict nee the deeper and less conscious influences. t may, І think, be fairly 
i Cd that even the simplest experiments along these lines will often throw more 
Une. € factors actually at work than either speculative ier ue a mas 
generaj pained statistical data. Only, therefore, when we fully un E ап е 
їпазуіу, Р8УСћоІору of the Rorschach test shall we be able to use it scient y 
Val assessments (10). г 


1 
to acde,, Venture to suggest, would seem to imply that, in child guidance work, it is highly inadvisable 
are oft interpretations Of children’s тере, Dn forward by psychologists or pes em 
Sven рро, tite unfamiliar with the daily life of the children in question and may S OE 
D Tea, ras up in another country. It certainly confirms the аш 0 EDU et UN 
ndon sius Ports by psychologists of German origin on English schoolc! 

Wald n A 
х Analy W. Prinzipien der Denkpsychologie, p. 619. 
© Psychology, TI, p. 110. 
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V. SUMMARY AND CONCLUSIONS 


1. A hundred Indian students were given the individual Rorschach test, two 
intelligence tests, and Cattell’s test of fluency. The Rorschach test was scored, first, 
by Beck’s standard method, and, secondly, by Burt’s method of rating the contents 
of the responses in accordance with a schedule of personality traits, such as 15 
commonly used in individual psychology. Ratings for the various traits were à 50 
obtained from independent judges, who were close acquaintances of the subjects 
tested ; and these were used as validating criteria. 


2. Correlations were calculated between the judges’ assessments and (a) the 
individual Rorschach categories (classified according to Beck’s scheme), and (b) the 
ratings based on an analysis of content. With few exceptions, the Rorschach 
categories, as scored by the orthodox procedure, gave figures that were statistically 
non-significant. On the other hand, the method of rating by content furnishe 
consistently high coefficients. 


3. Correlations between the 36 Rorschach categories were factorized by 
(a) simple summation and (b) the group factor method. The interpretation of the 
factors obtained was based partly on the internal evidence of the saturations them- 
selves, partly on the external evidence supplied by independent assessments, an 
partly on introspective evidence obtained during the supplementary enquiry. , ДО, 
bipolar analysis indicated, first, a general factor of fluency, and, secondly, two bipolar 
factors concerned with:cognitive and with emotional traits respectively. The first 
of the two bipolar factors appeared to contrast responses of a synthetic or generalizing 
type with those of an analytic or particularizing type; the second appeared to 
contrast neurotic with non-neurotic tendencies. The group factor analysis suggeste 
a basic factor of fluency, and three group factors which might be interpreted as 
representing very approximately general intelligence (an integrative or generalizing 
factor), general emotionality (with a marked bias towards introversion), an 
relatively mechanical tendency towards the associative reproduction of particulars. 


4. To provide a check on the inferences drawn from the factor-saturations, 
approximate factor-measurements were calculated for each factor; and these Wed 
correlated with (a) the results of independent tests, (b) judges’ ratings for the mon 
relevant traits, and (c) assessments based on an analysis of the content of the response $ 
On the whole the correlations appeared to corroborate the tentative identification 
of the several factors. 


. 5. The results of the factor analysis confirmed several of the orthodox interp 
tions regularly suggested for some of the commoner types of response, particu 
for those alleged to be indicative of intelligence and of neurotic tendencies. 6 
of the more formal categories, however, appeared to have little or no general psyo* ng 
logical significance ; and the inferences commonly drawn from responses ree 
to colour, movement, and the like seem to rest more on preconceived hypothe 
than on verifiable evidence. It is concluded that an analysis of content, ^ 11У 
on generally recognized psychological principles, and expressed іп terms of genera 
recognized psychological factors, would be far more valuable for practical purposes 
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GROUP FACTOR ANALYSIS 


By CYRIL BURT 
University College, London 


I, General Problem. II. Non-Overlapping Group Factors: A. Three Group Factors: 
B. More than Three Group Factors. C. Two Group Factors. D. One Group Factor: 
Ш. Overlapping Group Factors. IV. Summary and Conclusions. 


I. GENERAL PROBLEM 


Aim. In the following paper my main purpose will be to state the essential 
principles underlying what I have called the * group factor method ’ of factor analysis 
and to describe the simplest working procedures available for the commoner typ 
of problem that occur in actual practice. This will give me an opportunity of replying 
more fully to criticisms of the method raised by earlier writers, and of answering 
relevant questions received from recent correspondents. At the same time I 5f E 
endeavour to explain in what ways the underlying concepts resemble, and difie 
from, those involved in the related notions of ‘ simple structure ’ and ‘ bifactor analysis. 
The fact that the recent edition of Professor Thomson's book now includes 2 
* Addendum ° on the ‘ bifactor method ’ has aroused a special interest among Ви 
workers ; and has led to several enquiries about its relations to previous procedures 
which I shall attempt to answer so far as my own limited experience of it permits. 


Need for a Group Factor Method : (1) With Physical Measurements. In several 
previous contributions I have described how the notion of factorizing a correlat! " 
table into group factors originally arose (7, 8, 9, and refs). Here I need only recaP 
tulate the history of the concept quite briefly. 


is 

The need for some such procedure was encountered first in connexion with the analys | 
of physical measurements. To make the problem clear, let us begin with an image tne 
instance. Suppose we have taken four representative measurements for the size O A 
human body (e.g., height, weight, chest breadth, and arm length) and three for the 512 
the head (e.g., length, breadth, and height); suppose, further, that we have foun tions 
positive inter-correlations between the first four traits, and large positive inter-correla first 
between the last three traits, but zero or non-significant cross-correlations between Шш wo 
four and the last three : in such a case we should, I imagine, conclude that there wer 
distinct and separate growth-tendencies at work—one for the growth of the body 
another for the growth of the head. torial 

So obvious a conclusion would be drawn regardless of any thought or theory of fae theif 
analysis: it would have been unhesitatingly inferred by Galton, Pearson, or any 0 e for 
followers half a century ago. It is, however, convenient to have a more general num 


es 
* Factorial Analysis of Human Ability (1948), pp. 341f. In this country the bifactor method 5 д0 
almost unknown. The only large-scale enquiry in which it has (to my knowledge) been use E ‘ook 
unpublished research on * Temperamental Factors, recently carried out by Mr. С. Ј jl acl 
(Ph.D. Thesis, University of London, 1937). Thurstone, in his recent work (Multiple 
Analysis (1947), p. viii), complains that Holzinger and Harman “ dispose of the [his] simples on 
concept in a footnote” (10, p. 102); but he himself dismisses the bifactor method wit er 
parenthetical sentence in his preface. In my own view, as I have said elsewhere (7, 9), the 
methods put forward by different writers should not be regarded as rival competitors : eerste 
have its special merit for the purpose for which it was contrived. Hence I hope it will be UD rke 
that my criticisms relate primarily to the particular kind of problem on which my fellow" 
and I have been principally engaged. 
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Such hypothetical tendencies ; we may therefore agree to call them ‘ factors.’ And, since 
€y are factors that enter into limited groups of traits only, it is natural to refer to them 
as * group factors." 

If, as psychologists, we are interested more particularly in the growth of the head or skull, 
We might go on to enquire which of the three head measurements, or better still what 
Weighted sums of all three measurements, would give the best indication of skull-growth, 
and indirectly, we might suppose, of brain-growth. By means of such weights we could 
obtain explicit * factor-measurements ’ for our * group factor.’ 

Now, when we turn to an actual table of such correlations, for example, the table for 
Physical measurements discussed by Pearson and Macdonell in 1902, the pattern discovered 
15 Not so clear cut: we find that, while the inter-correlations within the two main 
groups are both positive and large (averaging 0-67), the cross-correlations are by no means 
negligible ; though relatively small (averaging only 0-27), they are all positive and all 
Significant. Thus, as Pearson expresses it, both ‘like’ and ‘unlike’ traits are positively 
torrelated, although the correlations between ‘like’ traits are much higher. Evidently 
WO independent and non-overlapping group factors will not suffice to explain the figures 
actually observed. | 
m Pearson himself, it will be remembered, suggested extracting as many uncorrelated 
a (or ‘index characters,’ to use his own term) as there are correlated traits, and 

Sosing for that purpose the ‘ principal axes’ of the correlation ellipsoid. In that case 
Bor Ctors except the first would necessarily be bipolar. However, on examining that 
= ‘on of his table which gave the cross-correlations, it was observed that, curiously 
for ugh, the figures were almost exactly * synclinal’ or ‘ hierarchical,’ i.e., they virtually 
qucd an oblong submatrix of rank one (see 12, Table І, p. 104). It therefore seemed 
would o nical to use seven factors to explain these particular figures when one factor alone 

Serve. And in fact a very close fit to the whole table was obtained with three factors 
One * general’ and two ‘ group’ (ibid., р. 118). 


ob 2) With Mental Tests. Similar correlation patterns were from time to time 
Soa in the early results obtained for mental traits. But it was not until about 
that the idea of group factors became widely accepted by psychologists. 


In ту first intelli i for “ f allied tests to 

с paper on intelligence tests (1909) І found evidence for “ groups of alli S 
Sig tate together un thug after ШЕ ERIS effects of general intelligence, there still remained 
сіта! Ог nearly significant residuals for the inter-correlations (а) between tests of sensory dis- 

Bet (b) between tests of motor co-ordination, (c) between sensori-motor speed tests, and 
results won tests and assessments involving memory (1, pp. 144, 164, and Tables V and үй нее 
Broups Be Confirmed and extended in an investigation carried out with Mr. Moore on ҮШ arge г 

ESSA Schoolchildren, tested by means of group tests, which included higher menta DIOC. d 
14 tests ent corroboration seemed to be afforded by Simpson's work with adults. Simpson app E 
In my о representing different cognitive levels (similar to the five * levels of mental proces E 1 
` certain a investigation), to 37 adult males ; he also found, in addition to general mental abi ity 
0 thos, relatively Specialized capacities ' belonging to particular categories or groups (very simila: 

associar already reported), namely, (a) sensory discrimination, (b) motor control, (c) memory 

howe Cation, (d) quickness and accuracy of perception, and (e) selective thinking. Simpson, 
"b Was his inferences merely on a comparison orhe овсун coefficients, and made no 

eliminate the general factor or test the residuals for significance. ў i^ 
he Obie Conclusions Ses] at first strongly criticized by Spearman and others. Not without duum 
сопу Sd» first, that the statistical techniques employed were crude and inconclusos an 
Judged Y that the numbers tested were too small for the results to be accepted as fully Seas 
9f over Loo € intercolumnar criterion, he found that my own data gave a mean Poe S GNE 
Bi > à 
measu "Ка, 1, 1901-02, pp. 177-207. In this enquiry the dimensions of body and head actually \ 
the RUE. ere not quite the same as in the hypothetical example ] have taken in the tex 1 9180 
by Scota gators had to choose their figures from the Bertillon scheme of measurements Collecte 
У and Yard from criminals. 


S А " 
Ix, S on Psychological Tests of Educable Capacity, Board of Education, 1924. In Appendix 
both a „Majority 


i i isi E hesis including 
of i Ч ting, at least provisionally, а hypot 

Seem General factor aa Mc iE factors. A review of the chief group, 26018 RU 
176-199 Pe fairly well established will be found in Brit. J. Educ. Psych, XIX, 1949, pp. y 
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writers at that date, he doubted whether the distinction between ‘like’ and ‘unlike’ traits, SO 
conspicuous in the case of physical measurements, could be expected to hold good in a field so 
diferent as that of mental testing, “ unless the *similar" tests were so similar as to be virtually 
identical. 

American criticisms were less emphatic. Freeman, for example, observed that ** Simpson presents 
his results in the form of a table similar to that which Burt reported, and the order of the tests bears 
out the same conclusion which was drawn from Burt's ” : nevertheless. he had ** doubts as to the 
validity of the classification " (^ according to mental categories "), largely because “ the correlation 
between traits in different classes is often much closer than the correlation between traits in the 
same class. . . . If Spearman's two-factor theory is correct, special capacities do not fall into groups.” * 


(3) With Educational Abilities. Conclusive evidence for the existence of such 
* group factors’ was first obtained in assessments of educational abilities. Here 
written tests could be employed ; and data could consequently be secured from samples 
large enough to render the first residuals fully significant and the second residuals 
definitely non-significant—at least in the earlier studies. 


To estimate the correlation of each trait or test with the hypothetical factors, a method of 
* simple summation ’ was employed, based originally on the assumption that the * observed average’ 
of a given set of correlations could be identified with the * true average.’ In dealing with residual 
correlations left after the effects of the first or general factor had been eliminated, two courses seemed 
open : ** we may look either (a) for limited group factors, shared only by similar sets of tests, whose 
influence will be solely positive (the natural assumption in analysing intellectual abilities), or (b) for 
supplementary general factors, ambivalent in their nature, which will account not only for specific 
resemblances but also for specific antagonisms (an assumption that appears perhaps more plausible 
in analysing emotional and temperamental differences). Moreover, these alternative aims will, as à 
matter of fact, affect our conception of the general factor, and the way in which we estimate it. . . - 
When we come to the secóndary factors, with assumption (a) the obvious working-procedure will be 
to take the significant positive correlations  (i.e., the clusters of positive residuals) '* in subgroups, 
and factorize these just as we should complete tables." * 

It was by this latter procedure (* Method a’) that, working in collaboration first with Mr. R. C. 
Moore and later with Miss M. Bickersteth (3), I obtained what seemed to be unquestionable evidence 
for the presence of special cognitive abilities in both mental and scholastic tests—abilities of a kin 
which at that time Spearman and others strongly denied on the ground that they resemble: 
the old-fashioned ‘faculties.’ The first explicit confirmation of our results came from one О 
Spearman's own pupils, Miss N. Carey, who based her deductions on school marks. “ With actua; 
tests of school abilities," she writes, '' Mr. Burt has found? a * general educational ability, 
combined with * specific educational abilities " (arithmetical, linguistic, literary, and manual), very 
similar to the combination of the * hypothetical general factor * with subordinate * group factors 
that he found in his intelligence tests ” ; and she points out that her correlations for school mark$ 
indicate much the same combination of three types of factor, namely, in addition to (i) the ‘ general 


1 Spearman's most detailed discussion of Simpson’s result is to be found in his Abilities of Mat 
(6, pp. 145f.). He there applied his later ' tetrad-difference criterion’ to Simpson's correlatio? 
table (it was the only large table for which this criterion was worked out in detail) ; and conclude! 4 
that the resulting graph “ displays one of the most striking agreements between observation an 
theory ” [the theory of a single common factor] * ever recorded in psychology * + jn a footnote 
however, he admitted that there are “ some very few cases . . . more than five times the pro a 

error ” (and a still larger number, it might be added, more than three times the probable error) 


Freeman's criticisms, embodying earlier comments, will be found in his Mental Tests (1926), PP: ls 8 
з Annual Report of the Psychologist to the L.C.C. (1914-15), Appendix I : cf. 7, p. 306 and рр. 62-68 
below. Here I should perhaps explain that the phrases used to designate the two alternative pror 
cedures have varied considerably. The full titles should be * summational analysis into genera, 


bipolar factors’ and * summational analysis into general (or basic) and group factors >; bu 


in 7, p. 306, and 8, p. 356) the two alternatives were simply described as ' Method a’ and * Metho M. 


respectively. Later, ' Method a’ was called ‘analysis by submatrices.’ Nowadays the procede by, 


` s Miss Carey is referring to results reported in the Annual Керо! i c. 0912 

i SEI rt of the Psychologist to the L.C.C- 2 

(briefly summarized in 3, pp. 34-37). Appendix I of ihe Rep of j LAM ША account obs) 
derivation of the formule (afterwards included in Mental and Scholastic Tests, 2nd ed., PP- 211- 
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and (ii) the * specific factors,’ (iii) a series of * group factors ° : (in her own data, these last comprised 


а verbal factor for verbal Subjects, a motor factor for handwork subjects, and probably a third 
factor for informational subjects). To simplify the computations, however, she proposed to modify 
my procedure by assuming that the residuals on which any one group factor is based are approxi- 
mately all the same in size. Incidentally she observes that her results are in marked conflict with 
the assumptions made by Spearman and Hart in examining my earlier paper, namely, that the high 
intercolumnar correlations obtained from tables like my own commonly demonstrate that there is 
no such clustering of high coefficients as the presence of group factors would imply. She herself 
actually obtains negative correlations between columns when these relate to tests that contain 
different group factors and low g-saturations ; and she uses this result to justify the way she has 

Partitioned her table. 
B In a number of later researches and theses, the group factor method was regularly employed. 
ut for the most part the working procedures adopted were improvised to meet the special needs 
Of each particular enquiry ; and, as the interest lay rather in the psychological results obtained than 
In the mathematical devices used, little trouble was taken at the time to establish them systematically 
9n a comprehensive basis. Indeed, during the 1920s, it would have been commercially impossible 
in this country to publish a book containing numerous correlational formulz and expensive tables 
except with the assistance of the London County Council; and the Council, quite properly, was 
More concerned to publish accounts of the practical conclusions reached than discussions of statistical 
Procedures, Asa result, during and since the war, younger psychologists have tended to substitute 
methucthods described in the more accessible American publications, particularly the * centroid 
i5 ace and the device of ‘simple structure.’ Nevertheless, it has apparently proved difficult 
requiro ee altogether with the * general’ or ‘basic’ factor, as ‘simple structure “Seems to 
actora And, partly for this reason no doubt, many requests have been recently received. fora 
illustrat method which would take such a factor into account. However, before attempting to 
Precisely in detail the working procedures suggested, it will be advisable to formulate a little more 
ely the underlying hypothesis and the fundamental prin@iples on which all of them are based. 


it а Hypothesis Involved. The early work I have just described seemed to make 
in th е ium more evident that, in all researches on individual psychology and especially 
eee eld of Cognitive and scholastic testing, a possible interpretation to be kept 
р Paty In mind was what I have termed the ‘ group factor hypothesis, The 
tested. 15 convenient, but curtailed. I use it to denote the hypothesis that the traits 
into or measured can best be analysed into (i) a basic or general factor, entering 
EE all the tests, and alone accounting for the cross-correlations between the several 

ups, and (ii) a set of group factors, either overlapping or non-overlapping, each 


1 
теу, N. (1916). Brit, J. Psych., VIII, p. 180. Miss Carey's procedure departs in one or two minor 
Wi ım my own. In previous discussions both Pearson and Spearman had criticized the 
definitign’ Proposed to deduce, from restricted parts of the table only, a general factor which by 
cargar? (0 cover the whole table. ў 
Should ы 9n insisted (and I think rightly, so far as a rigorous proof is concerned) that the first step 
a whol to show that significant residuals were left, even when the closest possible fit to the table 
Calculated” had been obtained (i.e., to use factorial terminology, that the general factor should be 
Furth, from the entire table by the method of least squares, or some equivalent procedure). 
that the ig argued (as mathematical critics of current factorial methods are still inclined to do) 
discover Сайіпр diagonal should contain unity, not reduced self-correlations. When we try to 
More ind € minimum self-correlations from unrepeated tests, we run the risk of trying to deduce 
doubly dicent variables than our table permits, and we further render any valid test of significance 
Pear; ie» ч E 
€g., tea VAN insisted that th 1 fact ht rather to be based on some external criterion, 
сүз te, ` e general factor oug e 
erani ers assessments or selected tests used as reference values ; and he considered that a true 
Herding, $3Ould show equal decrements, not rows of coefficients diminishing in proportion. 
Or her tos y, Since Miss Carey was venturing to criticize Spearman's own conclusions, it was necessary 
each P^ his general procedure. Thus, in calculating the saturation coefficients ( the correla- 
NOt fro, School subject with g”), she used Spearman's method, which derived the ‘saturation 
Чазигев 010 Cable of observed inter-correlations, but from two special tests assumed to give pure 
a о tow. 5$. In her main argument, too, she assumed that the hierarchy would be ad bu 
Bd 182). ards the end of her paper she is inclined to adopt my own procedure (cf. pp. 
erno; З 
ап arden? for example, obs, * in spite of the fact that no Navy or Army psychologist was 
Senerar fe disciple of ES а SOL dnd over again our test inter-correlations have yielded a 
Patt tw thre Tunning through all the tests and covering 30 to 40 per cent. of the variance, together 
"Sonne] » IR 9r more group factors ” (‘ Statistical Methods in the Selection of Navy and Army 
` “oy. Stat. Soc., УШ, 1946, p. 145). 
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of which is primarily limited to a specific cluster or group of tests (presumably 
measuring * like” or * similar” mental functions), and in all of which the significant 
saturations are essentially positive. 

Jn this hypothesis there was nothing new except the attempt to define the relevant concepts 
with sufficient precision to permit of rigorous statistical verification. Some such scheme of factors 
is already implied in the traditional attempt to classify mental faculties or functions into comprehensive 
genera and subordinate species. It is still more clearly implied in Galton’s doctrine that an individual's 
achievements in a particular sphere may be explained in terms of some * specific aptitude * for thal 
sphere of work, superposed upon a * general ability * required in every sphere. 

A Working Procedure: General Principles. Having defined our terms, ош 
next task is to deduce the verifiable corollaries of this particular hypothesis, an 
see how, in any concrete case, its consequences can be tested by means of the data 
actually obtained. Where more than one factor was concerned in any given test, 
it was assumed that, whether cognitive or conative, ‘ general,’ ° basic,’ * bipolar, 
or‘ group,’ the factors would combine (like velocities and forces in ordinary dynamics 
in accordance with the ‘ parallelogram law.? From this it follows that, if the 
factors postulated are independent and uncorrelated, the correlation between any 
two tests, rzy say, could be expressed as a product-sum of the factor-weights 0 
* saturations,’ i.e., 


Tey = Fly F Гаус К...» ) 
where the hypothetical correlations between the tests and the factors (xb, ly» ete 
are taken as weights (see Appendix, p. 72, equation iv). This in turn implies 
that any observed correlation matrix is regarded as analysable into a sum of supef 
posed ‘hierarchies’ (i.e., matrices of rank one), each of which may cover eithe 
all the traits tested or only some of the traits tested (8, p. 344). 

As applied to the group factor hypothesis, these assumptions lead to 7 50 
convenient requirements which any working procedure should fulfil. (i) First, di 
far as possible, all the significant residuals (on which the supplementary factors Wa 3 
to be based) should be positive, never negative." (ii) Secondly, “ all the non-signiice e 
residuals should for any given test (i.e., for any given row or column of correlat! 
where only one factor was involved) sum to zero." 


Each of these conditions requires a comment. d 10 

(i) To most teachers and educationists, it would certainly appear clumsy and far-fetche ich 
classify cognitive traits in bipolar terms, and to postulate ‘ abilities,’ or at any rate ‘factors, "oth 
have about as many negative saturations as positive. Indeed, with a few notable exceptions hers 
logicians and scientists have commonly been averse from dichotomous classifications W en mo^ 
are available.^ Accordingly, in researches like those I have described, it seemed simpler an tively, 
intelligible to postulate positive group factors for verbal, arithmetical, and manual tests тере ра}, 
(in accordance with ‘ Method a’), instead of classifying the various performances first into, s erbal 
or ‘ non-verbal,’ and then subdividing the ‘ non-verbal ’ (and sometimes quite illogically the en with 
into ‘manual’ and ‘non-manual,’ and so on (as required by * Method 5. No doubt, ece 
educational abilities the occurrence of an occasional residual with a negative sign is not of n 
inexplicable. But it should be regarded as exceptional. he clos 

(ii) If we desire to judge ‘ goodness of fit’ by the principle of least squares, then t оп wi 
fit will be obtained by taking a weighted sum of the non-significant residuals : for then our прога! 
be to minimize the square-sum of all the non-significant discrepancies. This entails a rather E arpo 
process of successive approximation. In practice, however, it appeared sufficient for mos р ted. 
to substitute the principle of ‘simple (i.e, unweighted) summation, similar to that à 


calculating an ordinary average or a mean deviation. This yields a direct and Si 


o two 


procedure for obtaining factors of the type required : (see Appendix, pp. 72-13 below). «df 
1 This assumption was first put forward in attempting to analyse the * general factors aP: was ost 


* Jevons, F. B. (1900). Princi ^ Sci z 
Hoste ah aoe ). Principles of Science, ch. XXX ; Joseph, H. W. B. (1931 
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Accordingly, let us now consider what are the most conyenient procedures 
or factorizing a given correlation table along these lines, and how we can best 
test the ‘ goodness of fit’ furnished by factors obeying the scheme described. 
We may take the simplest form of the hypothesis first of all, namely, that which 
assumes that there is no overlap between the various group factors. As we shall 
sce, the double principle just described leads to slightly different formule and slightly 
ifferent working methods according to the number of group factors required. 


II. NON-OVERLAPPING GROUP FACTORS 


Partitioning the Correlation Table. The device on which the proposed procedure 
pends is that of ‘condensation.’ After the observed table of correlations has 
een appropriately rearranged and partitioned, in accordance with the grouping of 
€ tests, the various figures or ‘ elements’ in each submatrix are summed, so as to 
reduce the entire matrix of observed correlations to one of lower order., Provided 
* groups show no overlapping, this can then be treated as an approximate matrix 
of rank one, except for those sums which now appear in the diagonal) But, before 
We can proceed to the actual calculation of the group factor saturations, we must 
first decide how the original table is to be partitioned, in other words, how the tests 
9r traits are to be grouped. - a 
his b Omson, in an appendix on the bifactor method, added to the last edition of 

0 


Ok, mentions three alternative suggestions. But one or two others may also 


be put forward. à 


` (1) Subjective Classification.—Probably the first and most natural suggestion would be that 
ae grouping should be ne 19 E ET by considering the nature of each test, and putting 
Bether the memory tests, or tests involving number, and so оп” (11, p.341). As numerous theses 
is a favourite method with the youthful investigator. Yet it seems highly inadvisable. It 
Prov; too easy a way of appearing to establish the very hypothesis the investigator. hopes to 
S € purpose of factorization is to allow the analysis to reveal, quite EE and 
ei Y, What are the actual groupings. If, however, we start by assuming (say) that al gu 
of АГУ tests form one group and all our number tests another, then we shall be basing the met es 
on the very conclusion which the method of calculation саш ао ШУ to к 
i Coefficient ing. Izinger prefers to calculate a * coefficient of belonging. i 
to 4 Somewhat оваа ле o E and, as ordinarily used, it does not always dead 
сос que subdivision. Most investigators seldom calculate every possible value for the CLIE 
Preconceivagind it is then all too easy to stop when the results correspond with the investigator’s 
esires.? 

ion). Correlati i i ak h or profile of each row of correla- 
t relation Pro, iethod is to make a graph or p. 
by T Brouping those Ke e a rm profiles. This procedure has been most strongly pavo А 
Whole ш; К. homson states that he himself prefers to compare, not the contour of ihe prol е ese 
Of the Ut Only the peaks of each row.” The device undoubtedly throws light on the ae 
1 Correlation table 4 ; but, in many cases if not in most, it does not yield a very satisfactory 


Gee, тето Short-cuts depend upon this general principle, e.g., the * sum method ’ of bipolar analysis 
t or со с ХатРІе, this Journal, II, i, p. 61). e 
hypothe’ It is possible that the whole research may have been planned to UE part Es 
15, invo Ving a specified grouping of the traits. But in that case it is essen a eho ae 
fit, Oe ath Actor matrix reached on this hypothesis is the only hypothesis that gives $ аср 
be ne Gace gives a better fit than any other hypothesis. In the simpler type of proble: 

EE UN e 


is h Simply by showing that a bipolar analysis leads automatically to the grouping assumed. 
rese $ has Ь 


ү 2 ; a 
arches een the experience of a good many workers in this country, em d EXEAT m 
thesis 4» q PE of the most recent and convincing set of illustrations wi 


3 i i i i hope may shortly 
be Publishes analysis of Tests of Manual Dexterity (Univ. London Library) which I hop y 


н, 
determin, hom zanda I suggested graphing correlations es ihe тале UN. MES 
3 ү ын: n inal’ ( hierarchical ’) o: ] 2 2 
Btaphesttelations. "ЕР A from Karl Pearson, who еН эчер ар аца 
too lep 5 Procedure presents an interesting picture of the relations suggest iip 
“ngthy and inaccurate to form a regular part of the routine proc 2 
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way of partitioning the table, because the profiles and the peaks of the observed correlations may 
be affected both by the general factor and by the errors of sampling. 

(4) Intercolumnar Correlations. Spearman, it will be remembered, in contrasting the older 
hypothesis of * multifocal levels? with his own * unifocal” hypothesis, suggested, as а crucial test, 
the value of the correlation between the various columns of observed correlations. “ When the 
columns belong to [tests from] different levels, the high coefficients would always come in non- 
corresponding places, so that the correlation between columns is negative " (Brit. J. Psych., V» 
p. 57 ; cf. 6, p. 139). With my own table, which he reprints, he finds the (uncorrected) intercolumnar 
correlation to be, for the four largest cases, + 0°81, for the five smallest + 0:59. Не therefore 
infers that, contrary to my own contention, the table gives no evidence whatever for grou 
Miss Carey, however, has pointed out that, unless the amount of correlation due to the group factors 
is large and the amount due to the general factor comparatively small, the intercolumnar coefficients 
will not be negative, but only reduced in size. In practice therefore, particularly if there 15 some 
measure of overlapping, the criterion may often prove somewhat inconclusive. 

(5) Preliminary Bipolar Analysis. The method which to my mind is by far the best for general 
purposes is to start with an ordinary factor analysis by simple summation, and then to take the 
positive and negative sections of the bipolar factors (provided their saturations are significant) ej 
indicating where the lines of partition are to be drawn. This procedure is entirely automatic an 
objective. In certain cases a preliminary analysis of this kind may be dispensed with : when the 
correlations have been rearranged so as to bring clusters of high coefficients as near to the diagon* 
as possible, it will sometimes be found that the lines of classification are sufficiently clear wit Ha 
further check. But such cases are exceptional : whenever there is the smallest room for doub 
an initial bipolar analysis is in my view indispensable. 


The Addition and Multiplication Procedures. In the group factor method, as compared 
with other forms, the chief novelty arises from the way the first or ‘ general ° factor hasto be 
computed. Since it is not precisely the same as the * general ' factor reached by ordinary 
* simple summation,’ it is perhaps better to give it a distinctive name. As I have state 
elsewhere, the underlying aim is to “ seek a general factor which will represent, not UY 
average slope or plane of the correlations, but the basic slope” (7, р. 308). Accordingly: 
we may call the factor so obtained the * basic? factor. Our discussion will therefore 
concerned chiefly with the different devices available for calculating this factor. Я t 
With the group factor hypothesis, as we have seen, the essential assumption 15 that. 
all the factors except the first are limited (save for a sporadic overlap) to groups О en 
tests ; and that in consequence those parts of the correlation table which relate to ‘unli 
tests will form parts of a * hierarchy’ (matrix of rank one), since they are due solely to 5! 
single basic factor (except for occasional overlaps which we have agreed to consider late 
Now, in an ideal case, where the correlations in this portion of the table are ris, 
hierarchical, it is easy to show? that the fundamental equation given above yie ds ish 
alternative procedures for calculating the saturations, which in such a case will furn! 
precisely the same figures. The first requires us to add the row-totals ; the secon 
multiply them. it 
The multiplication method leads to a formula which may at first sight seem simpler 5, and y 
forms the basis of the revised form of Holzinger's * bifactor method.'? Ор the other hanc, if 


ere 
1 To save space I shall not give algebraic proofs for all the formule implied by the procedures Ptr 
described. The more important are outlined in the Appendix below, and are discussed in 8 mer 


detail in the, roneo'd Laboratory Notes on Factor Analysis : Group Factor Method. There ilustra 
exposition gives, for each procedure, (i) the algebraic derivation of the method ; (ii) a simp’ licati” 
tion, based on an artificial correlation table, exemplifying the essential principle ; (iii) an арр peti 
to an empirical correlation table. From the discussion and references below, the interested ria 
will, I imagine, have little difficulty in supplying more details for himself, and finding ар 

examples in the literature. 


2 Holzinger, K. J. (1937). Student Manual of Factor Analysis, pp. 13f. As orig Я 
the * bifactor method” assumed that the common factors could e “thought of as split inept 
proportional parts, one of which is the principal factor.” As a consequence, the satura ion 
given for the secondary factors in the illustrative tables were always exactly proportion? ports 
corresponding coefficients for the principal factor: cf. Holzinger, K. J. (1935), Preliminary -on wa? 
on the Spearman-Holzinger Unitary Trait Study, No. 5, p. 6 and table on p. 2. This conceP 4 to 
originally put forward by Spearman, who sometimes called it a ‘ bifid factor.’ It 15 
several of Spearman's students, e.g., Hargreaves and Stephenson, but was not, "i 
adopted in any of their investigations. The earliest published reference to it is in the арре theo 
H. L. Hargreaves’ monograph, where he discusses the alternative forms which Spearm? 
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tru: Taen 
a en experience, what I have called the addition method appears preferable in actual 
First. here рУ ын memorandum two reasons were briefly mentioned for this preference. 
appreciably fie observed coefficients (owing it may be to large fluctuations due to sampling) depart 
effect op ee a strict hierarchical arrangement, there the multiplication method is apt to exaggerate 
оге е departures : in certain cases it may even give saturations over unity ; in others, 
Zero. Secondi е sums to be multiplied is nearly zero, it may give saturations that are too near to 
View of the Aine RIOR method does not lend itself to the usual type of check. In 
Ee 1 hat have recently been raised, these points perhaps deserve exemplification 


A. THREE GROUP FACTORS 


M remp. May I begin with what is at once the simplest and the commonest 
Worked n давау that in which there are three group factors? Most of the 
artificial SH ез given to illustrate this or allied procedures are based on tables of 
with Noe ations which give exact results, In such cases the figures obtained 
Чеге ME $ method and my own are likely to be identical. To illustrate the 
Tom the А. is therefore desirable to choose a table that deviates to some extent 
let us M SE hierarchical figures. To keep the working as simple as possible, 
REE able involving two tests only in each group (Table I). As usual, the chief 

Y arises, not over the calculation of the group factors, but over the extraction 


of the bas; : T 
group meee I shall therefore omit the square submatrices affected by the 


T. 
ABLE I. CALCULATION OF SATURATIONS FOR THE BASIC FACTOR 
Matrix of Observed Correlations, Partitioned and Condensed 


Test 
—_ Ч! 2 Sum 3 4 Sum 5 6 Sum 
1 
2 E S - 40 26 66 44 30 74 
н = 55 38 93 ۰05 02 07 
Шик. 
= == 95 64 159(4) | 49 32 ۰81(8) 
3 
40 355 95 жери. = 18 — 40 28 
E 26 -38 -:64 — — — 11 :09 :20 
m t * 
| 66 -93 1:59(A) — — — 29 9 "48(C) 
6 “44 05 49 18 ‘11 29 — — — 
ES ——— СО 02 32 10 09 19 = = E 
Bus ТА +07 -81(B) | 28 20 "48(C) E d = 
ot; 
Div al | 1-40 1-00 2-40 123 84 2.07 378 5 1:29 
ors | —— =) с: 25 
BOW 1:4652 2:1325 2:6087 
Sat (Add. 
at. (Muli) 15 6825 | 16380 | -5768 :3939 | 9707 | 22990 1955 | 4945 
:0087 ‘3682 2- 5731 -3975 = 2989 1956| — 


if nev ith both th 


i Si 
in the tj otal 


he work is considerably facilitated 
Iculated at the outset, and entered 
s and for the 


Ё foi addition and the multiplication procedure t 
of ob: Т the row-sections and column-sections are ca s 
ofa Gs Served correlations as shown above; the totals for the submatrice 
Poung al fa : $ 
into Ф and Sores might take : among the conditions under which 
Steppe СТУ test 1491 аз a single general factor,” one is that in which 
J. Ps, CNSOn’g < ii exactly the same proportion ” (Brit. J. Mon. Sup., х, 

Dh. XXx nanalysed abilities,’ due to * non-fractional factors, 

» 1937, pp. 100-102). 


** a general factor can be com- 
“two (or more) factors enter 
1927, p. 66 : author's italics). 
are of the same type (Brit. 
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entire columns can be calculated from these, and entered at the same time. This method of setting 
out such tables was suggested in the working instructions for my own procedure (e.g., Table VIII, 
Factors of the Mind, p. 478) ; but it can also be adopted to illustrate the method proposed by 
Holzinger (cf., for example, Thomson, 11, table on p. 339). 

1. The Addition Method. As in applying the principle of simple summation to bipolar 
analysis, so here the addition method requires us, after adding each column, to divide their 
totals by an appropriate divisor. The quotients will then give us the saturation coefficients 
for the first factor. With group factor analysis, however, we have to omit the diagona 
submatrices, since these depend on group factors as well as on the gefieral factor. This 
omission will naturally entail a corresponding modification in the mode of calculating 
the divisors. Instead of one divisor for all the column totals (as in bipolar analysis) we 5 а 
have to use as many different divisors as there are omitted submatrices—in the present 
example, three. й 

Let us use the letters A, B, and С to denote the subtotals of the three non-diagonal 
submatrices, as indicated in the table. Then, as may be easily shown (see Appendix, р. 73), 
for the first two columns (i.e., for the coefficients entering into the submatrices A an! 


the formula for the divisor will be Jc {үз г A5) -Ac is . It will be 


observed that the figures required for the terms inside the braces are the sums of those 
submatrices whose column totals are to be divided. Thus, in the case of the two columns 


comprising submatrices A and B, the divisor will be A 48 | 19 + M E = 1:4652. 


Similarly, for the next twa columns, it will be Al “81 { Ly | i5 | = 21325; and 


for the last two, A 1-59 W ы + 43 } = 2-6087. On dividing each column total bY 
the appropriate divisor as just computed, we obtain the first row of saturations shown m 
the last line but one. 709 
The theoretical matrix of rank one, to be fitted to the matrix of observed correlation 
is obtained in the usual way by multiplying each saturation coefficient by all the rest. T 
is analogous to the device first proposed by Pearson for fitting a theoretical matrix ofr 10 
one to an observed ‘ contingency table’ in cases of * manifold association ' (cf. Yule (191 ly 
Introduction to Statistics, p. 64, eq. 1, and refs.). To test goodness of fit we may acco! nod 
adopt the chi-squared procedure, employed by Pearson for such cases. The res! ш 
are found by subtracting each theoretical correlation from the corresponding O his 
correlation ; then squared, summed, and multiplied by the number of persons tested. tor 
was the method adopted in earlier investigations both with bipolar and with group fac 
analysis. However, in the latter case we need to locate the pairs of tests which appeal ve 
indicate an overlap of factors. Hence for most purposes it is simpler and more offer et? 
to test each residual separately by the standard error of the observed correlation. If te 
is no overlap, the residuals in the three side submatrices should all be non-significan e als 
Checks—(i) With this procedure, as a little algebra will show," each column of residu 
ire 
1 The only significance test that could be rigorously defended on statistical grounds would red ood 
the theoretical values to be fitted by the method of least squares. We could then derive a НКе n 
ratio from the products of the factor variances or latent roots. But the requisite formula ied 
I fancy, be too complex for practical use. Mr. J. R. Robertson has, at my suggestion: apers 
the simpler significance-tests to artificial tables constructed by using Tippett’s random nu corte 
and finds those described in the text to be reasonably satisfactory, provided the observer s cd 
lations are not large and the number of persons tested (N) is not small. When the О сае! 
correlations are large, I have recommended one or other of two methods of correction. E 
investigations the residuals were augmented by using Yule’s formula for partial correlation and 
later investigations, carried out after 1925, we have commonly used Fisher's z-transformatios сей 
multiplied the square-sum by (N — 3) instead of N. (This method of testing discrepancies {hots 0 
correlations has also been used for more general purposes, e.g., by Tippett himse^» А 
Statistics, p. 144.) 


з Proofs of all these checks are given in Laboratory Notes cited above. 
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(excluding, of course, the residuals in the diagonal submatrices) should add up to zero. We can thus 
Ee the same kind of check that we use in other forms of summational analysis. By way of 
ration I have calculated, in Table II below, the residuals for the first column. 


TABLE IL. ADDITION METHOD. CHECKING THE RESIDUALS 


Tests Correlations | Residuals 

Observed Calculated | 

епа аше Ж 40 5511 | = 151 

C^. ае 26 -3764 | — 1164 

5 | 44 :2857 + +1543 

6 | 30 -1868 | + +1132 

Sum | 1:40 1-4000 | -0000 
| | 


satu The Second check is also analogous to that used in simple summation. (a) If we add all the 
lock. hà except those for a particular block, their sum should be equal to the divisor for that 
With the je the Sum of the first four saturations is 1:6380 4- :9707 E 2:6087, which is identical 
Instead. of Е Изо (see Table I). (b) This check may be put in a slightly different form. If, 
equal to th adding the subtotals for the saturations, we multiply them, their product should be 
1590 е total of the submatrix common to the two blocks involved. Thus 1:6380 х -9707 = 
2 Similarly with the other sets of saturations. 

reduces е Multiplication Method. With only three factors the multiplication method 
9 à procedure ! which, at first sight, seems even simpler than the addition method. 


them ustead of adding the row- (or column-) sums of the separate submatrices, we multiply 
(66 4. э for test 1 (which enters into submatrices 4 and B), instead of taking the sum 
= 1-40, and then dividing it by a divisor calculated as above, we take the 
mainin those figures (-66 x -74) = -4884, and divide this product by the sum of the 

В submatrix (С), namely, -48. The saturation coefficient is the square root of this 
У(:66 x DE calculation of the several saturations thus proceeds as follows. Test 1, 


= 8] — En 48 = 1:0087 ; test 2, /(-93 х -07) + -48 = -3682 ; test 3, /(95 x 28) 
; etc. 
and We note (1) that the saturation for the first test here rises to over 1:0000, an impossible figure, 


i 
Spei, that the saturation for the second test falls to an unduly low figure, :3682. Generally 
to b an Will be seen that, wherever owing to errors of sampling the observed coefficients happen 
enlarge (nies (or too small) for a precise hierarchical fit, the process of multiplication tends to 
leithe: 9 diminish) the resulting saturation to an exaggerated degree. 


ч of the checks described above is applicable here. Thus the residuals for the first test 


a 
Те as follows 


TABLE ш. MULTIPLICATION METHOD. CHECKING THE RESIDUALS 


Tests Correlations | Residuals 
Observed Calculated 
1 
E | 40 578 | — 178 
1 | 26 401 | — 141 
44 -302 + 138 
З 30 197 | +103 
RON Ls | 1-40 1-478 | — 078 
1 
Holz; 
three Zinger LU : k Г 
, group 1. think, does not explicitly illustrate how his own method is to be used in the case о 
escribo factors only. "e sos peed аА Сай be readily deduced from his erm 
NE be, 0 OW) for the case of more than three factors; but with three factors only, the actual 


Б mes greatly simplified. See Holzinger's Student Manual, p. 26. 
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It will also be noted that the saturations given by my own procedure yield a much closer fit. 
The mean discrepancy (i.e., the average residual) is 092 with the addition method, and +107 with the 


multiplication method. 
B. MORE THAN THREE GROUP FACTORS 

1. The Addition Method. The principle described in section A1 above can readily 
be extended to the case in which more than three group factors are required. But, 
as in factorizing individual correlations, so in factorizing the sums obtained by 
condensation: whenever there are more than three given figures to be factorized, 
ie. when we are concerned with a square matrix of an order larger than 3 x 3, we 
must insert estimated values for the missing cells in the leading diagonal. р 

As an illustrative example let us take the following table. To avoid introducing 
large decimal fractions, which would not only take up undue space but also be harder 
to follow, I have chosen artificial figures ; and the values for the * observed ° correla- 
tions are made to differ very slightly from those which would have been obtaine 
by multiplying the saturation coefficients assumed when constructing the table. 

As before, we first rearrange, partition, and condense the matrix of individual correlations 
(Table IV). The row- and column-sums, and the totals for each column, are then calculated, 
as for Table I. 


TABLE IV. CALCULATION OF SATURATIONS FOR THE BASIC FACTOR 
- Matrix of Observed Correlations Partitioned and Condensed 


| | se | | | 
| | | 
| ^ (a) | | (6) | | (с) | (d) 
Test 1 2 fsan | 3 DR CU 6 | on | 7 8 un 
1 excel crieve css bg pua 36 ја |. 27. 18 || 
2 o ыст АЕ 71:059 40 320) 7200. 3 46 | 39 
or E SO TOT 223,| 784 “68 1:52 | -50 34 | “84 
| [т EC 
3 ШОКОТ pq 27 | 61 at 14 a 
4 Ж ө озу om m Bp. 241 «54. | 18; HS 
| m 
(b) Sum emme тоз gere —c 65 sr |0016 | 39 5 | 9 
| | | کے‎ 
5 ао EM "16 10. | 28 
6 | 036 -32 | -68 | 27 74 s Mss ed Т2 09 2 
|. 1 | | ER 
SEED [1 X2 54 |r EE ee = 
() Sum ..| 30 72 (152 | -62 554 | 116 | — | EE 
7 озо отага үзә] 16 32 ME 29 
8 | 18 116 | са 41 | 5. | do 309: 519-1 = aie Se 
«dj Sum) SS | 84 | 35 29 | 64 | 26 21 | T Келте 
Grand Total | 243 246 | 4:59 | 217 1:86 | 403 | 1-75 1-40 | 315 | 117 78 12 
К ل‎ | RGB US {= SZ 2 
Divisors ... | 270 | | 310 | | 350 | | 3:90 | 
Saturations 90 -80 | 1-70 | °70 -60 |130 | 50 -40 | -90 | 330 720 50 
| | | | | 
eT a 
We have now to factorize the sums of the submatrices. This involves factorizi ted 
condensed 4 X 4 matrix instead of the original 8 x 8 matrix. The sums, already cale Ар 
1 In describing my procedure Thomson observes that “ Burt only gives his formula for three коой 


(11, p. 341). However, in my paper on ‘ Factor Analysis by Submatrices,’ the use of the 7 cade! 
with more than three groups was briefly described End illustrated (8. pp. 350f., section Бел), 

More than three group factors," and equation iv, p. 352; cf. also 9, p. 482, note in Sm? pis 
It had previously been employed in one or two L.C.C. investigations and in several ЧОР 
theses by research students. 
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uet Endy of Table IV, are for convenience set out afresh in Table V. To factorize Table V 
ferrets summation, we must first complete the table by inserting suitable figures in the 
calculate. VE. of course, should be the squares of the saturations we are about to 
approximati ut, as we do not yet know the saturations, we must proceed by successive 
within ganon ; in this way we shall ultimately reach the figures shown in the diagonal 
manner packets: . The calculation of the saturations is then accomplished in the usual 
Noted ates ividing each column total by the square root of the grand total. (It will be 
89 — MES COME inserted in brackets agree with the squares of the saturations so found : 
etc. 


TABLE V. ADDITION METHOD 
Calculation of Divisors from Condensed Correlation Matrix 


Group " Sum of Sib @ Total 
8 Q89 223 1-52 84 748 
(c) 223 (1-69) 1:16 64 572 
er m 1:52 1-16 (81) 47 390 
= ue is 84 64 47 (25) 220 
ivi 
E isor 44 7-48 572. 3:96 220 - 19:36 = 44 
als. for Sums 
ушш $$ жу P. 


L 

Then thes call the * saturations for sums 7 Sa, S», Se, and Sa respectively, and their total T. 
S; is base ivisor for the columns on which Sa is based will be T — Sa ; for those on which 
Where mas eap S^; and so on. The total of the divisors = (m ~ DT = 3 x 440 = 13:20, 
test-colum efore denotes the number of groups (here four). On dividing the totals for the 
Shown in the as given in the initial table (Table IV), we obtain the eight saturation coefficients 

M Che. bottom line : thus 2:43 4 2-70 = -90, etc. 
tions ехсеј 5. The same two checks can be used as before : viz., (1) the sum of all the satura- 
block 5 2. those belonging to a given block should be identical with the divisor for that 

The ME residuals should add up to zero. Р 

On the mu ti ultiplication Method. As we have seen, the bifactor method depends essentially 
MY be supp Cation of sums rather than on the addition of sums. The working procedure 
qumarized as follows.* 
€ totals of the sums of the submatrices are found as in Table V, but without inserting 
Makes of ees for the diagonal cells : thus, 2:23 + 1:52 + -84 = 4:59. These totals are as à 
the ¢ the wor Even at the foot of the columns headed (a), (b), (c), and (d) in Table IV. But to 
divisors the B. quite clear, the sums have been copied out and totalled afresh in Table VI. То find 
» the totals so obtained are subtracted from half the grand total (6:86 — 4:59 = 2:27, etc.). 


TABLE VI. MULTIPLICATION METHOD 


Calculation of Divisors for Bifactor Analysis 


Group Sum of Submatrices Total 
© @ 6 © @ 
а 
(0) = 2-23 1-52 84 4:59 
(o 223 E 1-16 64 Ais 
me) 1-52 1-16 — 47 fine 
pal 84 64 47 — 
‘Visors 13-72 + 2 = 686 
4-59 403 315 1:95 ў 
2.83 3-71 491 1372 


D. 328-341. 
51 


Group Factor Analysis 


A second supplementary table (Table VID is now constructed from Table IV as follows. пре 
first figure in each row of Table VII consists of the total of all the sums in the corresponding row О 
Table IV, except the first sum (80 + -45 = 1-25); the next figure of the total of all the sums except 
the first and second ; and so on if there are more totals left to calculate. In the present case the 
second ‘total’ is simply the last figure in the corresponding row of Table IV, i.e., 45. It is con- 
venient to enter each of these ‘ diminishing sums > under the letter corresponding to the omitted sum. 


TABLE VII. TABLE VIII. 
TABLE OF DIMINISHING SUMS PRODUCTS FOR NUMERATOR 
Group| (а) (b) (o © Group (a) (b) (с) | S 
Test 1 — 1:25 45 Test 1 A 14750 3600 1:3330 
2 | = 1-11 -39 2 Z 1165 +2808. | 14499 
3 97 = 35 етар 2170 | 13810 
2 83 =. 29 a 59, -— 456 | 10] 
5 *:91 -26 — 5 .7644 1690 = 2227 
6| 2 21 - 6 | 486 107]  — 2 
7 67 28 = 83188:3350 += 1092 = 41971 
8 44 19 = в | 166 05 .— | 
ssively . 


_ These © diminishing sums’ have now to be multiplied by the figures that were Succe mired 
omitted, i.e., the first and second subtotals from Table IV (the last subtotal, for d, will not be red i 
because there is no entry gorresponaing to it in Table VII 3). We obtain the produ divi 
Table VIII. The products ih each row are then summed ; and the sums divided by the grouP ‘st (see 
already found in Table VI. The square root of the quotient gives the saturation for the 1e 
Table IX). It will be remarked that, although the * observed ’ correlations differ only very netho 
from the exact hypothetical values, as constructed from the saturation assumed, the bifactor shod '). 
does not reproduce those saturations so accurately as the group factor method (‘addition е 
Further, the checks аге no longer applicable. 


TABLE IX. CALCULATION OF SATURATIONS 


тҮ; ; rue 
Test Sum of Products Divisor 8 Saturations s 
(Numerator) (Denominator) Quotients (Square Root. Value 
. = -90 
1 1:8350 2:27 -8084 -8991 " 
2 1:4463 2:27 6371 -7982 E, 
3 1:3810 2:83 4880 -6986 -60 
4 1:0115 2:83 :3574 -5978 :50 
5 :9334 371 2516 -5015 д0 
б +5967 3-71 1609 4011 30 
1 4442 4-91 -0905 -3008 -20 
-1971 491 0401 2003 
ing 
: \ efer! 
Comparison of Alternative Methods. The theoretical grounds for Pi aditiog 


what I have called the addition method will, I hope, now be clear with t rocê 
method we are in effect basing our estimate of the Тыл coefficient олар of 
af averaging. In fact, the expression for the saturation coefficient, W h 
of calculation, might be regarded as a kind of a weighted average, the ©. 
as usual, the sum of the weights. This is the natural principle to adopt IP. nm» 


2 Тһе whole process is equi iplyi 
€ quivalent to multiplying each sum by the total of all t ў 
E ашо the products. The ‘ total of all the rest ’ Сай be obtained by С ures * 
е column total. Thus for the first row we should have the follow? um 450 
1 


(а) (b) z 
Sums (c) (d) 

e E 1:25 m. 1-6 5 2 
Products ... — 3 14750 5 iano Ре Li 0 36007 2 


52 


i 


mo 7? 


! 
| 
K 


"factor? 


CYRIL Burr 


out th : s 
The anata o the sampling fluctuations that must affect each observed coefficient. 
Е о арра ion formula, оп the other hand, is a direct algebraic deduction arm 
postulated S equation ; and the deduction is strictly applicable only when that 
the case tela иг ТШШ With an empirical table this can never be 
aro eae uently, the method is exposed to all the risks i 

Te implicitly multiplied instead of being added.) uda Is ы 


A few word 

that adopt rds are perhaps necessary to indicate how the procedures so far described di 
some ELS Dy Serna and his followers. There are three points of divergence а aer om 
any attempt E Gee e principal difference arises from the fact that Spearman always distrusted 
10 use “objective s ШЕ general factor on the basis of internal evidence. He therefore preferred 
is Would seem to b of * pure g,” and to eliminate g by the ordinary formula for partial correlation 
to eliminate is теоре а perfectly legitimate procedure, provided (a) that the general factor we wish 
general intelligence fiable with g as independently assessed (i.e., with what is ordinarily described as 
Otherwise it would ), and (b) that we possess a satisfactory set of tests for assessing * pure g.’ 
into account, " (i) аел wiser to take the information supplied from the entire battery of tests 
O deal with od ut further, Spearman himself thought it necessary, on methodological grounds. 
group factors АЧАДЫ moup factor ata time. Thus, in re-examining the evidence for the various 
might need to be d other investigators, instead of analysing an entire correlation table which 
ч! e, or part of a ИНДЕ Дор three or more group factors, he preferred to confine himself to a 
d Cross-correlatig le, which could be reduced to a 2X 2 form, consisting of inter-correlations 
pet гіп each of they etween two groups or tests only. And here, instead of looking for a group 
ef reference values? 9 groups, he insisted, as before, on treating one of the groups as consisting 
Ba Dossible group EEE solely on g, while the second group was examined for the presence 
te qumming Sentire à a ; (iii) Thirdly, Spearman objected to my attempt to derive group factors 
9 derive а general fact u men of * like" tests for the same reasons that he objected to my attempt 
we Verages ; and in his y summing the entire correlation table. In both cases factors were treated 
А "8 арріісађе oniyi is yew this was a mistake (6, pp. 61f.). Strictly speaking, his own formula 
ue ities * Were to be t e case in which there were two ‘reference abilities’ and in which two 
ntly, Some EAS SERE for specific correlation (6, p. 223, and Appendix, p. xxi). Conse- 
түри ication; as he says, “is introduced when the available abilities are more 
out every possible mice two can be used at the same time. . . . The more thorough 

metho Н АП му, 

Meanwhile, B. looking for a single group factor only I shall discuss more fully in a moment. 
о! onsider how the group factor method сап be adapted to the case where there 


nly 
Y two group factors, 


C. TWO GROUP FACTORS 


Fre 

0 "quenc Н У 

Ccurs in hice of Paired Group Factors. The need to deal with two group factors 
main types of investigation. 


Whi (0 It occurs + : 
Subst aS We Raa Sealing with those correlation tables 
ut Just seen, Spearman preferred to work. The procedure that he proposed to 


апа]ув. © 15 : 

Which p» all group cans satisfactory : few of his critics accepted his demand that, in every given 

For ex, е emp! oyed 015 but one should be dispensed with. Indeed, more often than not, the tests 
Ample, in de: о eliminate g seemed themselves to involve a second group factor of their own. 

: aling with Miss Rogers’ study of mathematical ability, he reduced the tests 


The; 
stime, аге oth 
Contare factor... 


that are reducible to a 2 x 2 form, with 


nt Or-me. Holzinger's regression equation for 
di he gr asurements (Preliminary Reports, No. 8, p. 7) includes only those tests that 
In th ; erences m n І have discussed this and 
Workin 9Deo'd Nor ore fully in (8). As stated above, full proofs of the relevant formule are given 
es already cited. But once again, I imagine, the reader will have no difficulty in 
pendix, I suggested 


f the notation of the Ap 5 
b,etc. The expressions 


cient to 


T s; Pto enter j ] 
Shor Posal (o. into the same set.” But in principle there seems very littl weer 
y “summation » or “averaging.” 


о » 
d, howeoo!” and the proposal to calculate factors b é 
her co ees that the fore which Spearman suggested for use when pooling was not 
+ even if his own assumptions are accepted (see this Journal, 1, pp. 96f.). 
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employed to two main groups : (a) ‘ arithmetical tests,” to be examined for a possible group fco 
and (b) ‘ reference values,’ to serve as tests of pure g. Now the latter included Analogies, Oppo: med 
Completion, and Thorndike's Test of Reading. All of them were verbal tests ; and it E his 
highly likely that they contained a verbal factor as well as ‘ pure =” The same was true O н 
discussion of Collar's research—a somewhat similar study of the arithmetical factor based on Inded 
in L.C.C. schools : here again it seemed clear that each of the two contrasted groups of tests inc 
its own specific group factor.” Jiest 
(ii) In the analysis of physical traits a similar situation often occurred. In е саа 
empirical table which seemed to call for an analysis in terms of group factors—namely, the cera 
Macdonell table for head and body measurements to which I have already referred—the Ted el 
division was clearly into two main clusters of ‘like traits The procedure which I very tenta B 
proposed drew sharp criticisms both from Spearman and from Pearson himself. In lateri opte 
with group factors, therefore, to avoid objections directed primarily against the technique a traits 
rather than against the conclusions drawn, we generally tried to arrange the battery of fes it was. 
so that there should, if possible, always be at least three group factors. In psychological MO BER of 
quite easy to secure this condition: indeed, in early work with mental tests, where the He factors 
persons tested was small, and the analysis could not pretend to isolate more than a coup! c T r han h 
in each test, this threefold grouping was the type most commonly encountered. On the ot Contrast 
in the field of physical measurement, the study of * body types '— particularly of the recurring i called 
between the leptomorphic (or asthenic) type and the eurymorphic (or pyknic) typ Y 
for analysis in terms of two group factors. found 
(iii) Later, when larger populations became available for mental testing, it was. опе 
that many of the group factors hitherto discovered were apt to get subdivided in their tu Dro 
the division was usually by pairs. Indeed, an extremely common type of factor-pattern Jr these 
to be that in which the whole set of traits involves two broad group factors, and then сас cceptable 
splits into two subfactors, and so on. Hence it seemed urgently necessary to devise an т coretica 
method which should at once bc convenient in practice and free from serious th 
objections. S 
. Now, even those investigators who have attempted to deal with several groun Y des 
in the same correlation table appear generally to assume that, unless à pater tor 
at least one test which involves no group factors at all but only the general pe « tree 
impossible to determine the factor-saturations with two groups only: ust а (it seems 
variables at least are always necessary for a Spearman ‘ two factor ’ analysis, ae Thus, 
to be supposed) three groups at least are always necessary for a group factor ana y wo grouP 
in illustrating how a set of seven tests can be factorized to give one general an st has 7° 
factors, Holzinger constructs his artificial correlation table so that the seventh te А j 
saturations for every factor except the first.? Actually, however, there are sever practice 
which the difficulty can be overcome. Here I shall content myself with what 1 
has proved to be the simplest and most convenient. 


e—constan 


a 


;,d the 
Example. In contributions previously published I have always applic’ of 
method to empirical figures obtained from actual researches. But for purr served 
illustration, the errors and discrepancies inevitably involved in dealing with all take 
coefficients tend to obscure the underlying principles. Неге, therefore, Т ylation 
an artificial table of correlations, constructed to give simple and exact cal 5 0 


A convenient set of figures, obtained by rounding off observed results 


* Rogers, A. (1923), * Tests of Mathematical Ability,’ Columbia Univ. Cont., NO- р? J. 

pp. 56-57; Collar, D. J. (1920), ‘A Statistical SU ey of Arithmetical Ability, Brit оеп 
XI, pp. 135f. Miss Rogers’ data seemed to me to afford admirable support for MY qi wass of 
Actually it included a third group of tests, mainly perceptual, which Spearman omitted. апау! 
possible to show that the verbal factor, which appeared with my ° two-group-factor factor yer 
the small table of correlatigns selected by Spearman, was identical with the vere e pl? gd 
appeared with a * three-group-factor analysis,’ when the whole of Miss Rogers table he factO™ pere 
With two group factors the chief objection urged by Spearman and others was that f 
their saturations could not be uniquely determined. But with three group factors, O over, ^j ifii 
could be no doubt about the uniqueness of the figures obtained. Spearman, Ох (cf. T 
maintained that, in any case, no such thing as a group factor for verbal ability existe 

of Man, р. 237). In later editions of his book he altered his view, and added this ft ice 
The reader who desires to try out the methods described in the remainder of this А 

in Miss Rogers’ figures an empirical table admirably adapted for the purpose. 


* Holzinger, H. Student Manual, p. 26. 
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educational abilities! is shown in Table X. The coefficients, as we shall see in a 
pament, have been built up by using one basic and two non-overlapping group 
actors ; the table therefore has a rank of three. Accordingly, by successive approxi- 
dpuom a computer, given nothing but the test-correlations, should be able to 
etermine exactly the appropriate values for the self-correlations (shown in Table X 
within brackets). 


TABLE X. OBSERVED CORRELATIONS 


Test 1 2 3 4 5 6 7 
1. Verb, Int 
rb. Int. TOME . 76 48 29 
2 ба ll I E (80 522 532 
4 Di CIE я -48 +52 (34) 21 
he ла ху 29 32 21 (13) 
-63 +56 ° +35 21 
45 40 25 15 
36 32 :20 12 


5 


— TABLE ХІ. FACTOR-SATURATIONS" 


A. Bipolar Analysis B. Group Factor Analysis 
Tests = 
Square- quare- 
I п ш quee dor PED 
l. Verb 
2. Sei, T (at 8824 0625 +1939 | 82:00 | 90 -10 — | -82 
3 еру (Gen) . | se] 13500 2.0596 | -8000 | -80 40 — | -80 
4. Dict, Mech.) ... 5471 — 41874 —-0747 | 3400 | -50 30 = | -34 
ESSO MM 3329 — 11249  —-0596 | 1300 | 30 20 — | 3 
m 
: vocat] o | 26191 26248 0000 — |250 10 — | — 
$ Mech Wb Int... | лоза — —4362 +0894 | 530 | 70 — 20 | 53 
аена ns e -5518 — —.3124 —0894 | -4100 | 50 ك‎ -40 | 4 
m (Praet) `; 490 —1562 0000 | -2000 | 40 — 20 :20 
m a 
Sq CEST 1:6762 —-6248 0000 | = |160 — б | — 
are. s. 
Contrib : 69 30 24 | 323 
ib, ‚е | 29003 2635 662 | 3-2300 | 26 
(ber cong, Уайапсе | 414 38 9 M61 |384 43 34 |461 


As i 
tna two poal We begin with a preliminary * bipolar analysis.” The saturations for tio genera 
IKN а cleg Qr factors, found in this way, are shown in Table XI A. Тһе ana ше 
S wi T line of division can be drawn between the first four tests and bx dee 
Th first Et Sarson-Macdonell table, the cross-correlations between the four г m denn 
та (бды and the three traits forming the second group fit a Баа а “they form a 
the WX of rang ations are shown as а separate table in Table ХП. Sie only, namely, 


Ta 2 
i. Pasic’? dun they can be regarded as due to a single common 


€ test. 
3 are described below on p. 62. 
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(i) Determining the Saturations for the Basic Factor.. We now proceed to analyse 
this rectangular submatrix in much the same way as we should analyse a complete square 
matrix of observed correlations, when extracting a single factor only. Since, Bo 
the submatrix is asymmetrical, the sums of the columns will no longer be identical with the 
sums of the rows. We must therefore consider how we can find a suitable divisor. 1° 
take the square root of the grand total, as we do with a symmetrical table, would, as а 
general rule, be quite indefensible. For one thing, there are four tests furnishing Colum 
sums, but only three furnishing row-sums ; and this fact of itself must make the column-sum 
smaller. Secondly, quite apart from the difference in the number of tests in either group, 
we have no right to assume that the factor-saturations in each group will be equally balance. : 
as we shall find in a moment, the average saturation for the first three tests 1s only , 
and for the last four 0-63. 4 И ٤ 

To allow for these two peculiarities we must therefore use two divisors instead of one 
For a first approximation the simplest procedure is to select the divisors so that im e 
group factor analysis the sum of the factor-saturations for the first three tests will ha E 
the same proportion to the sum of the factor-saturations for the last four tests as they 
in the bipolar analysis. ions 

Now let d, and d; be the two divisors required for calculating the group factor-saturatio E 
from the row- and column-sums ; г, and ta the sums of the group factor-saturations го Е 
corresponding groups ; f the sum of all the correlations in the submatrix ; and s, ап {е 
the sums of the factor-saturations obtained for the first and second groups of tests by 
preliminary bipolar analysis. Then 

t 
fı = Gp and ts mdi 
But = tıt, = did}. Hene d, = ts, and da = tı. 
51 


ty $1 $1 мз 
=з, We have d, = di; and t = did, = s, di А 


Непсе а= у [str] md Ау [st] 
Thus all we have to do is to multiply the ќоќа1-(/) by the proportio 


sums as given by the bipolar analysis (s/s. or s2/s,), and take the square root. 
inserting the appropriate figures from Tables XI A and XII, we find 


2-6191 

d, = {тете х 4-00} =:2:50, 
1:6762 

and d, — A sisi x 400} = 1-6. 


On dividing the row-sums by 2:50 and the column-sums by 1-60, we obtain the saturation’ 
shown on the extreme right and at the foot of Table XII. We can check the figures 210025 Ex 
same lines as before by noting that the sum of the saturations for tests 1 to 3, viZ., / == А 
d, ; and the sum for tests 4 to 7, namely, tg = 2:50 = dı. Further, 1:60 х 2:50 = 4-00. 


TABLE ХП. CALCULATION OF SATURATIONS FOR BASIC FACTOR (6) 


h d; 
Now, since = 
d, 


n of the two 
Accordingly 


Non-Verbal Verbal Tests i Row- Sats. 
Tests 1 2 3 4 Sums 

5 63 56 35 21 175 0 

6 45 40 25 : 15 1:25 40 
7 36 32 20 12 1:00 

1:60 

Column-Sums ... + | 144 1:28 80 48 400 22 
Sats. s 0 E :90 -80 ۰50 30 2:50 


Divisors ; for Column-Sums, 1:60; for Row-Sums, 2:50. 
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TABLE XII CALCULATION OF SATURATIONS FOR GROUP FACTORS (P) 
Кыл е _ ______ __ ._  — — A 1— à  .Á]Jeb 


Test 5 6 7 (for RENE 
Saturations (Basic Factor) ... “70 -50 “40 = 
EEE 
Residuals کا‎ СОЙ Eu ۰04 08 +04 16 
Soson c ОСИ ES 
Residuais a) ата ET 16 08 32 
кат ооо 9 
Residuals’... аз арра 04 08 04 16 
Totals of Residuals... an 16 32 16 ۰64 = :80* 
Saturations (Group Factor) ... :20 40 20 {30 

— 


Divisor for Totals = 4/:64 = '80. 


(ii) Determini 5 7 
to elimi ermining the Saturations for the Supplementary Group Factors. The next step is 
ay minate the effects of this р ог beste factor from the observed correlations. We 
saturations by reconstructing a theoretical hierarchy for the smallest group from the 
is 5 just obtained for tests 5 to 7, and deduct these theoretical values from the inter- 

i ns actually observed, as shown in Table XIII. The first set of residuals for these 
Sis are then factorized by simple summation in the usual way. We can then do the 


me wi UR 
Complet. the inter-correlations for the larger group, viz., that containing tests 1to4. The 


9 . " + n 
ace g A Qu hypothesis is that there are only two factors entering into the inter-correlations for 
0: 


Saturae еу suggest themselves by simple inspection. 1 er 
m S te Write out an algebraic SER for the sum of ШЕ residuals ipe O Hn ER 
Ces ated as unknowns, set this sum equal to zero, an so Y Be Ue rae Gel table 


ЖАША ап emni 
Square? such а ical table we cannot expect to find the secon : i 
5 s which will yield 


à " 

Work; е resi ic equation ٤ 
Уол Kin Tesiduals, in, it i It to construct algebraic eq tion 
ЧАН ew or obtain pu eamli oy set SHS inn (poe iy 

Slv Cessarily laborious. For most practical purposes 1 20 furations obtained with 
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the first trial in order to diminish both sets of second residuals so far as possible. Even 50, in the 
vast majority of cases, unless extremely large populations have been tested, the sampling errors 
will be big enough to render any elaborate adjustments of this kind superfluous. 

For a simple example of this procedure applied to an actual correlation table, the reader may 


refer to the analysis of the Pearson-Macdonell table quoted in the preceding issue of this Journal 
(12, p. 118, Table IX ; cf. also Table XV B below). But, as a rule, the matrix of non-overlapping 


group factors merely serves as an intermediate step towards obtaining the final matrix of overlapping 
group factors ; and consequently in most published articles only the latter is printed. 


D. ONE GROUP FACTOR 


For the sake of completeness I shall conclude this section by considering how the 
foregoing principles are to be applied to a correlation table in which there appears 
to be only a single group factor. Spearman, as we have seen, always doubted the 
legitimacy of trying to extract more than one group factor from the same table at 
the same time. When confronted with a table in which the possibility of severa 
group factors called for consideration (as in researches planned, not by Біте?» 
but by other investigators), he still preferred to take one group factor at à tune 
even if this meant discarding a large portion of the table. His method was to selec 
correlations so that these should, so far as possible, relate to two groups of tests 


only. One group was to be regarded as providing “ reference values forg": ber 
it was assumed, would involve no further factors. The other group would incu 5 
lemen 


or at any rate might include, in addition to the general factor, g, the supp. 
group factor, whose existence he proposed to examine (see above p. 53). 

Spearman's procedure, however, sought to investigate simply the presence or absence O! ate 
group factor (6, ch. ХШ). Even when he admitted its presence, he never went on to е5 mh 
its saturation coefficients. Indeed, he devised no formula, and suggested no method, for $ 
determinations. + 


1 The procedure which Spearman eventually substituted for group factor analysis is worth describir? 
in some detail. In the current text-books on factorial methods, his criterion is always descr re: 
but no explanation is given to show how he actually used it (cf. 10, pp. 68, 112f. ;. 11, рр: aps 
Thurstone, Multiple Factor Analysis, ch. XII). His own account (6, pp. 223 and xxi) is by nO M raph 
easy to follow. The fullest explanation of the method is to be found in Hargreaves’ “Monos an 
on * Тһе Faculty of Imagination’ (loc. cit. sup., 1927, pp. 18-24), where he describes in ders iof 
attempt to isolate a single group factor of * fluency ' from a correlation table including es tion 
* intelligence" and 6 tests of * imagination, to which tests of Memory, Speed, and Persevera 
were added later. 1 
As the investigation was carried out in L.C.C. schools, a preliminary conference was chers 
as usual, between the supervisor (Spearman), the L.C.C. psychologist (myself), and the head ions; 
concerned ; and the correspondence-file furnishes an instructive account of the alternative sugges Inn 
In discussing the statistical procedure, I had recommended “ an extended group actor me 5 
(virtually that described in Section II B above), on the grounds that, not only my own student dealt 
also “ Miss Carey, Bradford, Collar, and others had tried it with some success,” and that I «the 
most directly with the problem, as envisaged by teachers.” Spearman, however, replied t û [intet 
true criterion (the tetrad difference criterion) has now been provided’: “even the О T 
columnar] criterion has shown that the group factors, formerly claimed, have no real existence tors 
this is now confirmed by the true criterion." “ To estimate the size of the supposed gro fe. a 
merely by accidental departures from a calculated hierarchy is," he maintained, “very uns 
Every supposed group factor must be scrutinized by itself." « расої ү 
In his Monograph Hargreaves accordingly discusses each of the conceivable factors ог fart gence 
one by one. He begins by applying the tetrad difference criterion to the tests of Inte ips of 
and Imagination separately ; and so shows, or claims to show, that “each of the two 
tests—Intelligence and Imagination (‘ Fluency ’)—contains a general factor and no seco! 
or group factors." Then, applying the same criterion to the two groups “taken tog! 
states that “a group factor or second general factor is found." Thus, one of the two рУ 
factors previously referred to * must be compound." Rather summarily, he dismisses the po tating 
that the Intelligence tests might contain a second factor, ** because of evidence that is accu except 
against the existence of a group factor in intelligence tests ° (he does not specify this evic ences tests 
that on a later page he briefly refers to the possibility of a ‘ linguistic’ group factor 1n Hence ја 
similar to the ‘linguistic ability ’ I had reported, and rejects it as “ most improbable 7). le & 
his view the only conclusion remaining is that the initial correlation table contains 4 sing 
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TAB X 
LE XIV. CORRELATIONS BETWEEN PHYSICAL MEASUREMENTS 
One, Group Factor : Calculation of Basic Factor 


Em ОИНИ 
P Trait | 
Tm 1 2 3 4 | Sum | 5 6 7 |Sum 
. J | 
Dre c | CE2D 736 -759 197 | 3413 | | 
» t К Б | 36 E 3 Ё 
3. Ee ^07 228 (731) -661 -800 2:928 36 340 18 | 368 
eam 7 um "661 — (714)  -846 | 2:980 | -321 301 150 | 772 
= 800 -846 (803) | 3-246 | -289 305 135 | -729 
3 | 
Saturation — 7 3113 2-928 2-980 3246 |12-267 | 1318 1285 -674 |3:277 
TEES 376 367 -193 | -936 
$. Headp. “~ "363 E | 
miht. P 345 -321 289 | 1318| — : - = 
T Head в; 7 Ээ 340 301 305 | 1285| -395 eas “402 = 
aa 206 183 — 450 135 | 674| 618 42 = | — 
= «| 908 868 7722 729 |320| — = zx 
Saturation SE | 4021 3796 3753 3975 (15544) — — — — 
: 906 -855 -845 +896 | 3502) — = — — 
Divisor f e 15: | 
or traits 1-4 — 2 БУЕ 4:4381 Divisor for traits 5-7 — 3:502. 


\/12-267 


р Гой : 
Whi "strate 
hi the procedure I propose, we may take the Pearson-Macdonell 7 x 7 table, 


Was anal ; z 
0: + 
E ропа Saas edo a issue of this Journal in terms of two group factors super- 
Ws giring Whether we ic factor (see Table XIV). We may imagine a follower of Spearman 
bo у t assume that th cannot after all dispense with one of these two group factors. Can 
face and its limbs ere is just a single basic factor determining the general growth of the 
requis; et us iy Re à special group factor confined to the measurements of head and 
lte ore see what kind of a fit this hypothesis yields, by calculating the 


figures i 
according to the principle of simple summation.* 


The We begi 

fi Bin b: 08 

М Step is 0 determining the basic saturations for the four body-traits (as we may call them). 
e sums of the columns of inter- and cross-correlations for these four traits. 

for these, trial values are first inserted, 

f the self-correlations so found. 

the total of all four columns 

including the self-correlations 

+906, etc. (see 


К Sums must i i 

e ФП Corrected j include the self-correlations ® : 

(15:5, yo for each e Ordinary way by substituting the squares о 

fie. by y t e Brus co un of seven figures is found by dividing 

3st line 12:267 = 250005 of the sum of the inter-correlations, 
Table ). We thus obtain for the saturations 4:021 + 44381 = 


Uency,” а 
е { A 2 v 
> Entering into the tests of Imagination in addi 


S Were s 
t^ must pr 00 Obey the tetrad difference criterion, h 
what was later called a ‘ bifid factor” or * bifactor': i.e. as he says, the 


St еп : 5 
ter into the several tests “in exactly the same proportion. To clinch the 
] nature of the so-called fluency 


tion to the all-pervasive g. Since 
Songjactors in it is inferred that the ‘ compound 
Facto ion 


ti th i is not altogether 
Crit, fact is рг a lly admits that he is not altoge* 
са Поп {9 enough. Rie Where single group factors are to be separately 1 
P ut “ when several factors are likely to be operative, the [tetrad difference] 
of more than one factor, but 


"s formula, 
ore than one factor has 


Sat, tis Pro o 
Urat? Yuria, Cedure i i 1 to 1-000 : 
ion. 471 involves i -correlations are all equal t Я 
ns the assumption that the self-cor! re cific overlap, the 


D.96. When the cross-correlations incl 
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To find the basic factor saturations for the last three traits, we first sum the three 4-figure columns 
of cross-correlations. For these the divisor is the sum of the four saturations just found (3-502). 
We thus obtain 1-318 — 3-502 = -376, etc. The usual checks can be applied: for example, the 
residuals of each of the four 7-figure columns and of each of the three 4-figure columns should 
evidently add to zero. 

The saturations for the single group factor are obtained by subtracting the theoretical values for 
the inter-correlations between the three head traits (calculated by multiplying the three saturations 
just found) from the observed inter-correlations. Since for these traits there are only three observed 
inter-correlations, and therefore only three residuals, an exact fit can here be obtained. The | 


saturations are computed by the familiar formula rss = м Гъв "37 etc., where rso, az Рот denote 
"o7 


the three residuals : e.g., for trait 5 we have = ADS E V 4229 = -650, etc. 

The factor-saturations so found are shown in Table XV. For comparison I give the 
figures obtained on the assumption that there are two group factors (from 12, p. 118, Table 
IX). The question now arises: which of the two alternative assumptions furnishes the 
better fit? 

Judged by the absolute amount of discrepancy, the hypothesis of two group factors 
certainly yields the closest fit. The numerical total of the residuals comes to 0:257 with two 
group factors, and 0-502—nearly twice as much—with one group factor. Judged according 
to the principle of least squares, the difference is still more marked ; the square-sum 1n the 
former case is 0-0045, in the latter 0:0218. 


4 TABLE ‘KV. FACTOR-SATURATIONS 


A. One Group Factor | B. Two Group Factors 


Trait 
Size Head Size Body Head 
ПНО: 906 — 719 -501 — 
2i Height, л, cese 854 — -688 464 — 
3. Finger 205 eps ۰845 — :612 +582 — 
4. Forearm... 200 -896 — +578 816 = 
5. Face B. | :376 «650 | -508 — -505 
6. Head L. | 367 -395 495 = :284 x 
TaB re д) 193 839 260 — :962 
Square-Sum «e | 3:379 1-282 2:270 1-471 1-261 i 
Contribution to Variance 48:3 18:3 32:4 21:0 18:0 
(per cent.) 3 
277 7а Mice dA PEN NEN eee ee { 
f 


But are these differences significant? As the observed correlations were calculated 
by Pearson’s tetrachoric procedure, it is not easy to assign exact figures for the stan 015. 
errors of the residuals. Macdonell's estimate puts them in the neighbourhood О à 
Roughly speaking, therefore, any residual that is numerically larger than 0-03 
regarded as significant. With two group factors no residual reaches this figure. м0“ 
group factor seven out of 21 residuals exceed it. The standard deviation with th? the 
group factor hypothesis is 0:0147 (just about what we should expect by chance) ; i fi 
one-group factor hypothesis it is more than twice as large, namely, 0:0312. le 
In point of fact, a preliminary inspection of the table of observed correlations might have irix 
us to anticipate some such result. I have arranged the traits so that the rectangular suber as 


of cross-correlations (i.e., the correlations of traits 1-4 with trai form Jf 

i i n d its 5-7) should conform, 
possible, to a hierarchical order. It will be noted that they do so D remarkable closeness: for \ 
traits 1-4 were also due to the same general factor and nothing more, the inter-correlatio 5 der: ў. 


these traits should also be hierarchical, and (what is more important) should exhibit the Sa orar 


But this is not the case. The trait which has the /argest sum for its inter-correlatioDs 
has the smallest sum for its cross-correlations. 
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If our critic suggests that erhaps after all we might have been wiser to take, not the three head 
measurements, biki four body measurements as containing the sole additional group factor, 


then his suggestion would be still more strongly contradicted by the same argument : for, as simple 
inspection will show, Head Breadth has the /argest sum for its inter-correlations, and the smallest 


ae for its cross-correlations. Thus, both sets of inter-correlations demand a group factor of 
their own. 


In practice I am in ined to believe that, with correlation tables which are expected to reveal 
only a Single КТӨН n e investigator will, as a rule, be better advised to start БУ allowing 
or two group factors, and then prove that the saturations for one are non-significant. n ша Б 

Owever, I should regard an experiment which yields a correlation table showing one, group actor 
only (or indeed no more than two group factors) as badly planned. Unless the number o рео. 
€sted or measured is exceptionally large, and the standard errors in consequence exceptionally 
mall (as they are here), it is usually possible to fit such tables with so many different group factor 
Patterns that the results are bound to be inconclusive." 


III. OVERLAPPING GROUP FACTORS 


Rotation. Wh I r of persons tested is large, and especially when 
the assortment of tests p Roch seen fulfil a practical need rather than to solve 
à theoretical Problem, a pattern of non-overlapping factors is likely to give a some- 
What incomplete fit to the correlations observed : in such investigations, as experience 
Shows, we nearly always discover residuals indicative of overlap 2 in the outlying 
submatrices, But in any case we cannot really assume that an empirically derived 
“ctor matrix must always contain a number of saturations that are exactly zero. 

ence it is desirable to have ready for use some simple arithmetical procedure which 
will enable us to readjust the schematic pattern of:non-overlapping factors so as to 
yield а closer overall fit. © i | 
ob For this purpose the most obvious device is to rotate the bipolar factor matrix 
tained Y simple or weighted summation to fit the factor-pattern obtained by the 
group factor analysis.? The use of orthogonal transformations is, of course, a familiar 
Procedure in mathematics ; and the idea of arithmetically rotating factors in this 
fre was first Suggested by Garnett (5). Graphical methods of rotation have ues 
stit у used by Thurstone ; but, as Reyburn and others have pointed out, the up 
allo on ОЁ graphical for numerical methods is not only less exact, but appears to 

V wide room for subjective influences (13). 


Example, To i i 1 tary procedure and a 
à . То illustrate the need for this supplemen ary p ure 
of wenient method of carrying out the calculations let us consider the artificial table 
este lations Set out on p. 63. It may be regarded as based on a set of 10 cognitive 
“pplied to a sample of 500 cases (Table XVI). 


idance igi i igati i i ‘ational and vocational 
Buidanc ihe 4 riginal investigation, planned as part of an experiment in education 


n > a series of written, oral, and practical tests was applied to several batches of boys, mostly 


А A Г а е; 2 
fro; г this Teason that I have not illustrated my arguments in this section by an exemple Giana 
in which Jl measurement, Earlier discussions (e.g., 6, pp. 224-241) provide nu 


i is too large 
to m One group factor only is discussed. But in nearly every case the probable error is too lar 
tend b {ШУ clear comparisons profitable. Further, in many cases the уласна М 
Наг emselves to introduce a misleading parallelism into the coefficients. ] 


SU Brea vec» Н b -92 ;. and 
this of pues, table (cited above) the reliability of the tests employed varies from 0:23 to Merton 
SEE fends to produce an appearance of consistent proportionality between th 
B © cross-correlations i ional 
е . : а 
traits, Tistence of ‘ overlap between group factors was first noted in the d E YER 
(4, Dp. 59 P'Oved to be a regular feature in analysing correlations bereen ean had described 
us 9 or mores +, Spearman uses the term in a somewhat diten ed ا‎ OE EDENE 
a Cir respective бу z voting Te factor he preferred to say that they = 
X ў с factors (6, р. 82). < given in Brit. J. Educ. Psych., IX, > 
MES ? 9, р. 308. A more fully worked example is given in Brit. J. 


e! i ctor of the 
Log Inve n (amongst others) was described in a Report to p» Сеара Des 
epartment on Examinations for Junior County Scholarship. 


“С. Б Stigatio, 
ps Its 
to the Council for permission to make use of the results. 
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between the ages of 123 and 141, and attending trade schools in or near London. | The vie 
tests consisted of two tests of Intelligence, verbal and non-verbal, two papers on Arithmetica 
Problems ’ and ‘Practical Geometry’ respectively, and finally two papers on * Science" requiring 
brief written answers mostly in essay form—the first, a * general ' paper, dealing mainly with elem 
tary physics and including interpretations of diagrams, the second, dealing more specifically wit 
‘Mechanics’ and involving some appreciation of formule. These written tests were со 
543 boys, usually as part of a terminal examination at the institute they were attending. Md 
further tests, Dictation and Mental Arithmetic, were given orally, but required written Tende 
they were given to the same boys, with the exception of two smaller schools. Two practical oe 
were included in the scheme—an Assembly test and a test of Practical Science (physics, mechans 
and the like); but these could only be given to a limited number of pupils, amounting in а 
to 457. 


The correlations were calculated for each group separately ; and the average correlations 
(weighted for numbers and reliability) were factorized by the method to be described in a mons 
To obtain a more convenient table for purposes of illustration, I have here rounded off the ваш а 
Coefficients to a single significant figure ; and have then reconstructed an artificial table of * observ UN 
correlations (Table XVI), which, as a matter of fact, closely resembles the table obtained from (d 
largest group. Ву taking an example based on an exact set of saturations, we shall have a defin 
check for the figures ultimately reached by the procedure proposed. 


Bipolar Analysis. We begin with a preliminary bipolar analysis. Four factors nd 
been extracted, three of which we may regard as statistically significant. The saturatio E 
so obtained are shown in Table XVII A. The interpretation of the three significant faerie 
is quite obvious. The first is a factor of * general ability '—a weighted average of al 0); 
tests ; the second divides the tests into non-numerical (tests 1—7) and numerical (tests 8-1 T 
the third divides the non-numerical tests into verbal (tests 1-4) and non-verbal, i.e., praci Si 
(tests 5-7). And the factur pattern as a whole suggests that the correlations could proba Up 
be fitted by means of a comprehensive basic factor supplemented by three smaller gro' us 
factors. However, in the lateral submatrices of Table XVI several of the coefficients E 
much too large for a perfect hierarchical arrangement. Hence we may anticipate 50 
degree of overlapping between the several group factors. 


Analysis into Non-Overlapping Group Factors. As before, we may take the Број 
analysis as indicating how the correlation table should be rearranged and where the een 
of division between the several groups should fall. The rearrangement has already a 
carried out in printing the correlations in Table XVI ; and the table itself has been D 
titioned accordingly. The next step is to factorize it in terms of three non-overlapP 
group factors in order to discover the general frame-work. 


e 
An experienced computer would probably realize at once that a satisfactory fit could only bE 

obtained by successive approximation, and would probably start by reducing the more conspi rouP 
enlargements due to the overlapping at the outset. Alternatively he might seek to base the P ical 
factor analysis solely on those parts of the lateral submatrices that conform fairly well to hierar here 
requirements. The beginner will prefer a simpler and more automatic procedure ; Observed 
therefore I shall assume that the method described above (p. 48) is to be applied to the o 
correlations taken just as they stand. 

n 
.. Ihe results of factorizing the table by the method described in Section II A are E 
in Table XVII B. This second factor-pattern gets rid of negative. saturations ; but It is 
festly will not fit the table of observed correlations as well as the bipolar matrix. tions 
easy to see, for example, without actually reconstructing a theoretical table of corre И 
from the factor-saturations, that the residuals for tests 2 and 3 with 10 and for tests 2 factor 
with 6 will be fully significant. What we now require therefore is a readjusted grouP 


н ] . the 
matrix derived from the previous bipolar matrix (Table XVII A), but conforming uo 
Schematic pattern exhibited by Table XVII B. © 
а ап: 
We could, of course, use these two tables to calculate a transformation matrix : if thi г 


t 
formation matrix proved to be strictly orthogonal, then the fit obtained by the rotated fac old 
would be as good as that obtained direct from Table XVII A itself, and the new factors р nd 
themselves be orthogonal. However, the correlation tables reconstructed from the 810 trans" 
bipolar factors (sections A and B) would be by no means identical ; and consequently the Tation 
formation matrix would be far from orthogonal. We might go on to modify the transfor 
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matrix, so as to make it orthogonal. But this procedure would, as а rule, be rather laborious ; and 
in any case is suited only to more experienced computers. 


Calculation of Rotation Matrix from Condensed Factor Matrices. If, however, instead 
of taking the initial table of observed correlations, we proceed to reconstruct a new сом 
lation table from Table XVII B as it stands, and if we then subject this reconsttug 
table to a bipolar analysis, we should have two factor matrices, bipolar and group, whic! 
furnished exactly the same set of correlations. The rotation matrix for transforming the 
first factor matrix into the second would then be perfectly orthogonal. Since, however, 
we are concerned with factor matrices containing 4 columns only, the order of the rola 
matrix will be 4 x 4; and consequently, to calculate such a rotation matrix, We do 26 
need group- and bipolar-matrices containing 10 rows ; 4 will be sufficient. Let us there on 
condense the group factor matrix into a 4 x 4 matrix; from this let us reconstruc К 
symmetrical product-sum matrix, by the procedure regularly used for reconstructing Al 
theoretical correlation table. The 4 x 4 matrix so obtained will then, as a rule, be idente 
with the table we should get if we condensed the reconstructed 10 x 10 correlation ta 
to a 4 x 4 table of summed correlations. af 

Accordingly, with the present investigation we can begin by splitting the first group 

- 4 tests into two subgroups, the first containing test 1 only and the second tests 2, 3, an it t 
With the other three groups we can take the sums as they stand. We shall then work S 
the sums of the factor-saturations for these four groups (see Tables XVII B and күш Бе 
The 4 x 4 matrix of summed correlations constructed from these sums 15 shown 1n ed 
XVIII B. This is now subjected to a bipolar analysis ; and the saturations 50 Mu. 
are given in Table XVIII C. These should be identical with the sums of the bipolar, fac 
saturations we should have extracted from the complete 10 x 10 correlation matrix." ting 

It is now an easy måtter to calculate the transformation matrix, required for rota ine 
the figures in Table XVIII A to those in Table XVIII C. There is no need to дегет 
an inverse. The fact that the sums of the non-overlapping group factors include a nae 
of zeros renders a direct calculation, by the ordinary method of solving linear equation 
perfectly simple (for the method of calculation see Appendix, p. 75). 

The transformation matrix thus obtained is given in Table XIX 
easily satisfy himself that it is orthogonal. 


Rotation of Original Factors. (a) First Approximation. We now vil A): 
transformation matrix to postmultiply the original bipolar-matrix (Table X 
This yields the set of overlapping group factors as shown in Table XX A. been 
pattern of saturations is fairly satisfactory ; and, if the number tested had 
small, it could be accepted as it stands. 
It reveals several clear instances of positive overlap, e.g., tests 7 and 10 in the sero (for test 


(V), test 9 in the third factor (P). But it also yields three negative saturations over 01 ble po 
3,8, and 10). Such negative values are apt to occur in sections where there is an apprecia 


A. The reader can 


*« analysis 
"It has, as a matter of fact, been used in several research-theses (e.g., Miss Stevenson 5 апау 
of tests applied in the Navy during the war). The orthogonalizing process is best carrie figure? 
follows. The general principle will be to restrict the largest adjustments to those rows To begin 
of the transformation which will multiply the smallest columns in the bipolar matrix., the first 
with, we should leave unchanged the first row in the transformation matrix which multiplies 50 
column of the bipolar matrix ; we should alter the least important figure in the secon б еа! 
that the product-sum of the first and second rows is zero; we should then modify the "apd so 
important figures in the third row so that its product-sum with the first two rows is Zero > ke the 
on. Finally, for the last and least important row of all we substitute figures which will m? 
sums of the squares in each column equal to unity. ve 10 
2 As in the ‘sum method ’ of factor analysis (this Journal, II, рр. 62f.). Certain cautions Horiz 
be exercised in regard to the reversal of signs. The beginner is advised in his first efforts t ing G 


o 
Sn 4 ling 
the complete 10 x 10 artificial correlation table as a check : with certain modes of рое slig or 
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overlap in the final saturations, and where the provisional non-overlapping matrix gives oe 
saturations: the negative saturations are then arithmetically necessary to balance the POS US 
saturation that represents the overlap. By taking the correlations in the lateral submatrices SE 
they stand, we have in effect treated the augmented coefficients as non-significant results of samp ШЕ 
fluctuation. That means that our saturations represent averages, and the residuals represent Re ni 
significant deviations, positive and negative, about such averages. At this point, therefore, hich 
desirable to make a rough test of the significance of the larger positive and negative saturations v 
take the place of the previous zero saturations, and, if they approach the size of the probable error, 
proceed to a further approximation. 7 d 
(b) Second Approximation. There are various ways in which an їрє 
rotation can be effected. Two may be described as most convenient for gene 
purposes. р 
53 a : à А le 
(i) The rotated matrix just obtained reveals where the overlaps occur and what is their proban 
size. We can therefore go back to the original correlation matrix, and deduct from the aE 
coefficients the amount apparently due to the overlap. The analysis into non-overlapping Ed 
factors can then be repeated with the smoothed correlation table ; and the whole of the i seq 
calculations carried out afresh. This is probably the simplest routine for the beginner. 


(ii) An alternative method is to modify two or three figures in the rotation matrix, во 250 
suppress the larger negative saturations, and then orthogonalize the columns afresh. | figures 
negative saturations in the second and the third factors are obviously due to the large negative EAS) 
in the second and third columns of the rotation matrix (— ‘8379 and — +8598) : these chal the 
readily be reduced. Minor adjustments must then be made in other figures to ensure olumns 
product sums of the different pairs of columns are exactly zero ; and each of the readjusted ci 
must then be normalized to ensure that the sum of its squares is 1:000. n 

Final Results. An adjusted rotation matrix obtained in this way ? is sho 
Table XIX B. This is used, аз before, to postmultiply the original bipolar mar" 
(Table XVII A). The resulting factor saturations are shown in Table XX B. n the 
comparison I append in the last four columns the factor-saturations from wu ДО 
original correlation matrix was actually constructed. It will be observed E. 
agreement is remarkably close. With three minor exceptions the discrepa an 
between the calculated values and.the true values are all less than + 0:015 ; 


В = с д К sire. 
of course, with a further series of approximations we could get as close as We de 


‚э factor 
The interpretation of the factors is fairly obvious : the first is a * basic io 

of * general ability? ; the rest represent * special abilities’ for verbal, practica all 

numerical work. The instances of overlap are perfectly intelligible, when We г 

the nature of the tests. 


Constancy of Group Factor Matrices. In any field of enquiry where t 
gators may wish to assign a concrete meaning to each of the factors extracte rs 
desirable to determine the factors in such a way that their interpretation shall a pich 
the same, even when they are calculated afresh from a modified battery, to ДЖ 
certain tests have been added, or from which certain tests have been omite: this 
previous accounts of the group factor method, I have claimed that it will fu 


invest 
he шиа 


1 As noted above, the more experienced computer would probably have observed at the very 


which of the coefficients disturbing the hierarchical arrangement showed significant augmen ` Even 
and would have made tentative deductions before undertaking the first group factor analysis. е! ated 
with a brief inspection of Table XVI, it was fairly obvious that test 10, for example, shows ехабе aller 
correlations with tests 2, 3, and 4. When these coefficients have been reduced, severa 
augmentations are readily detected (e.g., tests 2 and 8 with tests 5, 6, and 7). F. R; 
? This and the remaining calculations were made for me by a former research student, Mi rigint! 
Llewellyn, who knew nothing of the true values of the saturation coefficients or of how t fee 
correlation table had been constructed. Т felt that, were I myself to make the requisite АЧ value 
I might be consciously or unconsciously influenced by my previous knowledge of the ID сй thé 
—or at least that it might be supposed that some such influence had operated. Having As rtifici 
figures shown in Table XVII B, Mr. Llewellyn began to suspect that he was working ОЛ Bh carte 
problem. In the hope of suppressing the negative saturation for test 6, he spontaneous, value 
out a further approximation. Working with six decimal figures he then obtained the r 

to within five decimal places (including the negative saturation). 
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requirement within the limits of the probable error. May I therefore conclude my 

description by a brief illustration of this point ? 

т Let us take the last correlation table (Table XVI), and omit the final group of tests, i.e., the 

quumerical * tests, 8, 9, and 10. The remaining 7 x 7 table can now be factorized by the method 
ei cdbed in Section C for the case of two group factors only. The saturations obtained first by 

рге Iminary bipolar analysis and then the final analysis in terms of non-overlapping group factors 

are set out in Table XXI B. P 


TABLE XXI. FACTOR ANALYSIS OF CURTAILED BATTERY 


| 
Test | A. Bipolar Factors | B. Group Factors 
ests | 
| 1 I Img muc V P 
1. Verb. Int | | Е 
Ла. -874 :076 :225 | 881 121 — 
$ Sci. 1 (беп) | 895 138 —144 881 219 = 
4, Dig? (Mech) | +542 21] —042 -489 345 = 
5, No . A | :330 ۰140 —:039 :294 :223 = 
6 Men Verb. Int. ... | -708 —:139 -098 "704 = 178 
T lech. Ass, ... SEE, | :542 25:354 — 029 | -484 x 451 
* Sci. 3 (Pract) | 447 — 072 — 2069 | 48 = 093 


e | 
On compari ; ve fe E 2 TRAN 
ment j paring the figures reached with those given in Table XX B, it is obvious that the agree- 
could exceedingly close. In only one instance (test 2) does the discrepancy exceed 0:100. We 
Significa We wished, attempt a further approximation ; or, if the small residuals were statistically 
zeros, ts Seek a slightly closer fit by a rotation which would substitute small saturations for the 


he initial Correlation table, as we know, includes several instances of overlapping. It is this 


Overl 
APPing that has caused the small discrepancies. With only two group factors we can scarcely 


Expect t, 4 
b 9 detect or to al it with a faccuracy. The ideal procedure would be, as 
efore. allow for it with any degree o! y. RE oad Tbe ELE 


ical vok inci Iculations so ent 

Dra € the principle of least squares. The calculati 7 у 

c iere pur exceedingly Eh ; and a general the results would differ but little from those 
n he simpler escri above. 3 

factor ith artificial воена whatever, the saturations reached by the group 

added | Nalysis would necessarily be identical throughout, no matter how many groups of tests we 

we dropp removed (provided, of course, at least two groups remained), and no matter what tests 

Will be Pped from the several groups (provided, of course, at least two tests were left in each). S 

In point POR if the reader turns back to Section II C, and considers the analysis of Table ZX, whia 

Overlappi; fact was constructed from the group factor saturations shown in Table XX C Ve ue 

constant dE iliminated (cf. Table XI B). In such cases therefore the figures would remain perfectly 

ariant. 


Si f Р ; PLA 
many Ple Structure. Thurstone's conception of a unique ‘simple structure “1s In 


nis imi i i ications had described 
as a ‹ „JS closely similar to what some of us in earlier publications had : 
Condi group factor pattern.’ A ‘ simple structure ’ should, we are told, satisfy p 
Shou] Brut р denotes the factor matrix of r columns, then * (1) each d о 4 
least. „Ae at least one zero [saturation] ; (2) each column of V should have a 


Entries 20105 ; (3 à s there should be at least r traits whose 
Ties (3) for every pair of columns th ere ttl be observed that the second 


Of thes nish in on h 
e In one column but not the other. V : cs 
Such E Conditions precludes any ‘ general’ or ' basic ' factor, since by definition 
9r must have a non-zero saturation for every trait in its column. 
rations of the 


from differences in 


Tried our APalysis b i B ; , ily orthogon 
0 a t necessarily orthog' 
3 mew Eu not y a series of ‘ rotations’ (no sor’ 


е; arithmetic raphical procedure li termine the 
chont linse + Nor is pest oe thee of the sign-pattern of the centroid mati nt on the 
Next © Of the 5 Of the several group factors. Their limits ee ect dolet of variables for the 


t rotation computer at each stage, as he selects this or t 
1 6 
Vec,, 
Or, 
5 9f Ming (1935), р. 156. 
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Both the similarities and the differences can readily be observed in the concrete examples where 
both methods of analysis have been applied to the same correlation table, e.g., where Professor 
Thurstone has re-analysed a set of traits, previously factorized by the group factor method, and 
where a set of traits previously factorized in terms of Thurstone's simple structure has been re-analysed 
by the group factor method." Wherever a strict orthogonal rotation would inevitably yield a general 
factor, Professor Thurstone prefers to analyse the correlation matrix in terms of an * oblique * simple 
structure, i.e., in terms of correlated factors; and then (as he has suggested in his more recent 
contributions) a further analysis can be undertaken, if we wish, to extract a set of * second order 
factors, one of which will be the general factor. 

Thus with Thurstone's procedure three consecutive analyses are required to do what can 
frequently be done by a single analysis, if we seek a simple group factor pattern.^ In more complex 
cases, where there is considerable overlapping, there seems no reason to insist on conditions 1 and 3. 
Thus, instead of assigning the zero (or non-significant) saturations solely to satisfy certain а priori 
requirements, expressed arbitrarily for all cases in terms of the empirical number of factors (a number 
which is largely an accidental result of the size of the sample), my own method is to decide their loca- 
tion by the aid of the preliminary bipolar pattern. That pattern, of course, is in turn really deter- 
mined by the explicit or implicit hypothesis, which has guided the investigator in choosing hls 


particular sets of tests or traits at the very outset of his experiment (provided that his hypothesis 


is applicable to the data so procured). 

There are other less obvious differences depending partly on our mathematical and partly on 
our psychological preconceptions. But these would require discussion at some length, and mus 
therefore be deferred to a later paper. Here I have said enough to show that there are clear be 
ferences both in method and in results. May I add that I do not wish to deny that there may. ^ 
problems for which Thurstone's procedure is the best, just as there may be other problems for whic! 
the bifactor method may be more appropriate ? 

In conclusion let me repeat that this article has been primarily concerned 
working methods only. There are a number of theoretical issues (e.g., the precise re 
the bipolar classification and the grou» factor classification, which is by no means simpl 
of the smaller bipolar facto;s, the criteria for choosing between a bipolar and a group fac 
thesis, the method of planning experiments for group factor analysis, etc.) which I have n 
to touch on here. 


with practical 
lation between 
e in the case 
tor hypo- 
o space 


Advantages of the Group Factor Method. In my previous discussions of various 
factorial procedures I have suggested that the factors extracted should, if poss! 7 
fulfil three main conditions: paucity, unequivocality, and stability. From on 
survey of the group factor method, it may, I think, be fairly claimed that its resu "E 
will in general satisfy these requirements. First, the values given for the group factor. 
saturations will be (within the limits imposed by the sampling error) either positiven 
zero ; hence each of the traits or tests to be factorized will be accounted for in tete 
of the minimum number of uncorrelated factors. Secondly, each factor will а 
(within the limits of sampling) uniquely determined. Thirdly, the factors will rema! 
stable, if not absolutely invariant, even when the battery of test 
is modified, e.g., when à comparatively small battery is enlarged by the а the 
of more tests or more groups of tests, or when a large battery is curtailed Ro 
omission of tests. Occasionally, it is true, the results obtained may fail to prese - 
all three conditions. But such cases are, I believe, exceptional. They Occ an 
my experience, only where the investigator has selected his battery unwise y jeld 
where, as a consequence, no factorial procedure could really be expecte to Y 
satisfactory results. 

4 fora 
1In Psychometrika, ХІ (1946), p. 20, Professor Thurstone presents an alternative analysis thas 
table of physical measurements. In Brit. J. Educ. Psych., IX, 1939, pp. 270-276, Dr. EYS 
carried out a group factor analysis of the correlation table given by Professor Thurstone in 
on Primary Abilities. 


his 
anal siS 
2 [n the second edition of his book (11, p. 297) Professor Thomson gives a ‘ second order anger 
of а 7-variable version of Thurstone’s box problem, and shows how it leads to à gener the ol 
containing a ' general factor plus an orthogonal simple structure. " But this js simp! У facto 
basic-plus-group-factor pattern, reached by a more circuitous route. Reached by the gro! Brite *` 
method the pattern of saturations is very much the same, but the fit is closer (cf. Bur Ж 
Educ. Psych., table on p. 47). 
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IV. SUMMARY AND CONCLUSIONS 


l. Many tables of correlations, covariances, or product-sums, particularly in 
Psychology, can be expressed in terms of a simple and intelligible factor-pattern, 
i the observed coefficients are treated as the result of (i) a small number of positive 
(8) aub. factors each entering into a few of the variables only, superposed upon 

кар ше basic" factor entering into all the variables. The aim of this article 
(i n to make a comprehensive and systematic survey of the working methods 

DE. Proved to be most effective for this purpose. E 
qu By definition the significant saturations for both the * basic’ and the * group * 
cone will have positive values only. Two modes of approach are available for 
ЖЕ ing this result -an addition method and a multiplication method. „Reasons 
суеп for preferring the former. With that method the essential principle is to 

sense the matrix of observed correlations by simple summation. This yields a 

‘ola mode of analysis which is both easy and speedy, and in the simplest cases 
es an adequate set of factors after a single analysis. a 1 

begin KA here there is any doubt about the grouping of the tests, it is advisable to 

a ae ith a Preliminary bipolar analysis. The pattern of signs so obtained supplies 

correlation indication for the appropriate partitioning of the table of observed 


4. Slightly different working procedures are described for the cases of one, 


tw 
is Mere, and more than three group factors. The simplest and commonest case 
general three group factors. The hypothesis of subdivided factors, however, will 


ere j Y require the determination of paired group factors * and with larger tables 
S no fixed limit to the number of factors that can be determined. - 

group f, here the sample is small, it is usually sufficient to seek non-overlapping 
for oy Doors Where the sample is large, it will generally be necessary to allow 
less S арріпе. This can best be done by an arithmetical rotation, which appears 
can be 2 10 Subjective influences than graphical rotation. A suitable rotation m 
then У Computed by condensing the group factor matrix by Sa oora 

these sums. a bipolar analysis of the product-moment table reconstructed from 
ely determined, econ- 


GNE: i 
Omica] ; Py, it is ; : ound are uniqu ‹ 
mical in BUD ê argued that the factors so fi q T red or disais 


er, and virtually invariant when the battery is en 
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APPENDIX 
I. Basic Formule 


Postulates and Definitions. Let x; and y: be the observed measurements of any given individual, 
i, for any pair of correlated traits, e.g., his marks as obtained from two mental tests. We shal 
assume that such measurements can always be expressed as linear functions (weighted sums) of a 
set of uncorrelated factors. Accordingly, let us write 


Xi = Wabi + Wati +... + Was, ° А 6 Я ‘ o 
Yi = Wabi + Wyeci +... + Wali, + а i : . 6G) 


where bi, ci, si, and t; are i's hypothetical measurements for the several factors and remain constant 
for the same individual in all the different tests, and where Wz», Wre, Wyb, Wye, etc., are the weights 
or * factor-loadings ’ for the several tests, and remain constant for the same test in all the differen! 
individuals, i.e., they denote the proportionate amounts of the factors 5, c, etc., in the tests X and у. 

Evidently the weights may take positive, negative, or zero values. A factor which has positive 
weights for some of the tests and negative for the remainder will be called a bipolar factor. A NER 
which has positive weights for some of the tests and zero values for the remainder will be calle a 
group factor. A factor which has positive weights for every test will be called a basic factor 1 ۹ 
(or practically all) the weights for the other factors are positive, but smaller : otherwise (e.g; 1l 
there are bipolar factors) it will be called a general factor. A factor which has zero weights 1948, 
except one particular test will be called a specific or * unique ' factor : when the tests are not repeatec, 
it will include the error factor. implified 

The measurements will be rendered comparable, and the ensuing formule greatly simp! t 


if we suppose the x, y; b, c, . . . s, and f are all expressed in unitary standard measure : QUEE 
Zx*—Zy—...—1; and therefore w + wre +... + we = 1 ; and similarly for the welg 
in y. 


With these assumptions the following theorems can readily be deduced. 
(i) Saturation Coefficients. On substituting for x from equation (i), we have— 


A [OA fe М Ы 0, 3 . = (iil) 
Fz = VE? VED 7 = (ab + wee +... + Wwas)b = wa, | 
51 = 


since Z cb, ... Zi sb, will be zero (because by hypothesis the factors are uncorrelated), and ons 
Similarly, rz. = Wze ; ry» = Wyb ; rye = Wwe. Thus, the correlation between a given test and апу the 
of its component factors turns out to be identical with the weight assigned to that factor s {he 
theoretical construction of the test. (Spearman’s statement that the weight is proportiona se wi 
square of the correlation between the test and the factor springs from a confusion of this ca: 

that of a correlation between two tests where the weight of the common factor is assume 

the same for both tests.) 


" AES Р m ave 
(ii) Product-Theorem. Similarly, on substituting for x and y from equations (i) and (ii), weh 


Sx Rc (67) 
ra= aa = WatWyb + WaeWye +... = rry + Faye + ++ n. 
(iii) Bipolar Analysis. Now let us suppose we have n tests dependent on the general OF n the 


factor, b, and on one supplementary factor, c (say), which we will first suppose to be common f all the 
n tests. Then, taking the total of the xth column of correlations and the grand total O 
correlations, we have (from (іу)) 


Zr па Web F Ре Phe 

V3Zne^ V(Enmyd (ny 75 a of its 

provided 2 rie = 0. Thus, for equation (v) to hold, c must be a bipolar factor, and the Slated, 
positive and negative saturations must be zero. This treats the saturation coefficient, 50 C T imple 
as a kind of average. Equation (v) gives the ordinary formula for factor analysis ا‎ onê 


summation.” It can obviously be generalized to include the case where there is more 
supplementary factor, like c. the 


11 
(iv) Group Factor Analysis. The working methods that follow depend on what I ож are 
* principle of condensation” (see above, p. 45). The factor matrix and the correlation M s of the 
first partitioned conformably, according to the grouping of the traits. The column-section ices of 
factor matrix, and the submatrices of the correlation matrix, are then summed. The mô relatio" 
sums so obtained I call * condensed ' matrices. It can then be shown that the condensed СО 


v) 


(k,k! 21,2,...x,... m ° 


r 

| facto 

1 There is no a priori reason why bipolar and group factors should not appear in the зай f the 
matrix. All that is necessary is that the extraction of each factor should reduce ех describe 


matrix by one : and this, as is easily shown, is a very broad requirement. The methods he 
can readily be combined. 
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Matrix (i) wi Fe 
actor O vill nave the same rank as the original table, and (ii) can be deduced from the condensed 
y partitioning the от the summed saturations. For, let Ra be any rectangular submatrix obtained 
Of the factor mat : correlation table ; let Ris = FFs’, where Fy and Fs are the corresponding sections 
WEE. where y and let w’ denote the summation operator (1, 1, . .. 1]. Then wy Кн = 
Semi-condensati € жЕ, and мо are the sums of the column-sections! Further, if W’ denotes the 
ion operator (whose rows consist of l's or O's), then Wo RW, = ШЕЕ И. 


(а) О, 

tests, LO Gre ee Меш Let us now suppose that c is common only to a limited group of similar 

the group factor c р T let us take the case of four tests only, with the first two tests alone involving 

the whole correlate xpressing the inter- and cross-correlations in terms of the factor-saturations, 

in Table XXII "on table, and the sums which result from its condensation, may be written as 
. ere the capital letters denote sums of factor-saturations, i.e., 


2 4 2 
7 B, = гь, B, = гь, Cy = гь: 
To find AS 1 3 1 
rst find the divis actor-saturations from the column totals (71283, ra(Bı + Bs). . ., еїс.), we must 
Or the South-East ts By and (B, + В). We can obtain B, by taking the square root of the sum 
two enh rant, viz, МВ? ; and we can obtain (В, + Bs) by dividing the grand total 
9 find tgs 
by multiplying sh factor-saturations for the group factor 
Second D Оро 088/6 factor-saturations, г and raj, 
ау. У, dependent solely on the group factor c, which c 


c, we must subtract the * hierarchy," formed 


from the North-West quadrant. This leaves 
an be factorized in the usual 


(b) T 
modis, , Wo Gri E 
ro ied to include Factors. Let us now suppose that the South-East quadrant of Table XXII is 
is to n Column-sus group factor of its own, whose saturation-sum is Da, The divisors for the 
s ^ determine the ms of the North-East quadrant will be B, and В; The simplest procedure 
» And then, if neci 0 B,/By provisionally (e.g., from the bipolar analysis), calculate C, and 

Th essary, find improved values for В, and Ba by successive approximation. 

Satura; ^ "ee Gri А 
Urations mite Factors. Let us take Bı, Bs, Вз to represent the sums of the basic factor- 
€ three groups. Then the sums of the non-diagonal submatrices will be B,B;, 

3 and В.В. ЕЕ 

2B. BıBs . В.В. L ype The divisor 


BB 
To find Ву, for example, we now calculate BBs 


[B,B; . BaBy 


9r the р 
first 
VERE [ [E column will then be B, + Bs = BB, XUI = 
ace 1B, Rp 2 
} ВВ. X 
Other q BB, + JR , which is the formula used on p. 48 above. Similarly for the 
Visors, B,B, J 


use the analogue to equation 


(9), (d) More 
on) instead of the 


„taki than Thy, 

"пуца the ene ГА Group Factors. For the general case 
al соед, 775 Of the coefficients in the submatrices (obt 
Metho, 
true Xact r, 


we simply а 
ained by condensati 


cients the 
mselves, 
hat the observed correlations are 


d of L 
east Squar 
esulta quares. The foregoing proofs assume t ved | Я 
ants of the basic and group factors. With an empirical table this will not be strictly 


апу рү, е 

еп test, hi iblv 2 hat the averages of its observed an 
as, Correlati ; however, we can plausibly assume t at the averag 
Su, Sume thay HONS are virtually the inc For example, taking test 3 in Table XXII below, we 


IC 
nj ently at the s а responds 
cl average correlation fax’ e enotes an observed value) corresp' 
eee orrelation 7.’ (where the dash denc Aet with the * true values 


RASY E 
a(R Sra =,» With the , " > lations ( 
апо pg = average of the * true ’ correlations Vs М н "eg 
i B). tU + B.) exactly. If, therefore, we assume ar = st ye тау а diro 

ntamount to supposing als left after 


Cie 
all 3 bet aj. 
th We € use of thi ^ 
to e fac oP the e above formulz is therefore ta t d 1 
а ine ote and the observed values for the several correlations бс ШЕШ ше Sea ui 
i thi en EA [ ations. З 
rm extracted) are the result of chance uer сынба EU provided 
: В С 
Is dece, į T of tests has been used. 
ll which the factors m based 
all (parti i is assumption is only Ver 
icular factors), this as: = 
Y true, H ly in the case of the group, cto be better to proceed by seeking 


n of partiti і 
tk partitioned matrices see 

°п, A. C. (1939), Determinants ап 
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үе ү шаг values for the saturations which, with any postulated factor-pattern,? will maximize 

Procedu 5 ea of the test-measurements or test-correlations actually observed. This leads to a 

sum will uM ich minimizes the square-sums of the residuals, instead of assuming that their algebraic 

y the t ezero. In practice, however, it is usually found that the differences in the results obtained 

Batun О methods are not large enough to affect any demonstration of the existence or essential 
€ of the factors so discovered. 


П. Calculation of Rotation Matrix 


Let B be the summed bipolar-matrix, G the summed group factor matrix, and T the required 


Totation matrix. 


Then BT = G, and GT’ = B. Writing the last equation explicitly and substituting 


po Values for G and B from Table XVIII A and C, we have 
17586 :0579 — .0000 -0000 aejp -8506 -1568 -1062 -1453 | 
1:5875 6299 0000 -0000 bfkq 1:7352 -4177 4628 —-1453 
17134 0000 -8244 0000 cgmr 1:6300 -4680 —-5690 0000 
"0000 .0000 1-4228 dhns 19681 —1:0425 0000 -0000 


1 2 Е 5 
Proviq Ower case letters are substituted for the several columns of 7". The above matrix equation 


Equations 
t 
quaniu (9 b 


With four systems of simultaneous linear equations, each system containing four 
e solved for four unknown quantities. Let us take the first column: of unknown 


Quantities’ namely, a, b, с, and d. Changing to the ordinary notation, the matrix equation for these 


may be expanded as follows : 


'8816a -+ -0579b =" 58506. ОТ) 
1-7389a + -6299b = 1:7352) 9. a ED) 
1:5875а ++ :8244с = 1:6300 ЕШ) 
Ultiplying f 1:7134а + 14228d = 1:9681 .  .  . (iv) 
Ying (i) by -6299/-0579 = 10-8791, we have: 
Subtracting r: 9:5910а + -6299b = 19-2538) ИУ) 
"acting (ij) from (v) we get : 
7-8521а = 75186. 
Subst, Hence = 9575. 
Stituting for a, we obtain a 


from (ii) 1:6651 + -6299b = 1:7352. 
from (iii) 1:5200 + -8244c = 1:6300. 
from (iv) 1-6405 + 1-42284 = 1:9681. 


Hence b = :1113. 
Hence c = ‘1334. 
Hence d = :2303. 


Iculated by this method, the figures 


uct, Бу verifying that the square-sum for each of the four rows amounts to 1:0000, 
Wh I Shoulg Sum for every pair of rows to zero. 
poe been © express my indebtedness to my colleagues, 


Dr. Banks and Mr. Summerfield, 
enough to read this article in proof, and check many of the calculations. 


53 


Satu teu < 

абое Ну о 

qe anions, reg Phrase * factor-pattern ’ to denote the general scheme of positive, neg: ye O iE 

(eg, Mounts Sardless of their numerical value. The full procedure would zeae PTO E 

Hd at о-оо ME maximum likelihoods obtainable with diferent hyphen fcror-patiern, 
Csi at a а i pattern, а "E T js 

WS морі structure pus Sener pisaha a R as provisionally proved, that hypo: 
Mor, fur, of group factors only), an: Ў 


ikeli г * "maximal maximum." 
shed the largest value for its maximum likelihood—a ргоо 


О for these v cases are given in the s and postulates 
LPB 24б ? Factor Method. “Кога fuller discussion of the initia ay р 
> Which incorporates the essentials from earlier mem 


FACTOR ANALYSIS BY MAXIMUM LIKELIHOOD : 
A CORRECTION 


By D. N. LAWLEY 
The Mathematical Institute, University of Edinburgh 


Tn a recent paper (1) Mr. W. G. Emmett has quoted formule for the standard 


errors of factor loadings estimated by my method of maximum likelihood. > 
use of the formule, however, requires an additional qualification, since the derivation 
given in the paper (2) from which they were quoted is itself in need of correction. А 
It has now been found that the estimated loadings obtained for a given Se Е 
tests in one factor are slightly correlated with estimated loadings obtained for s 
same set of tests in another factor. This in turn means that the standard ean 
given by the formula for the estimated loadings in the rth factor are correct only 
if the values of loadings in previous factors are regarded as already given. | 
I should add that this correction in no way detracts from the value of Mr. 
Emmett's own paper. 
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TESTS OF SIGNIFICANCE IN FACTOR ANALYSIS 


By M. S. BARTLETT 
Department of Mathematics, University of Manchester 


I. Descripti 7 

iption and Illustration of a Test of Signi is i inci 
С, of Significance for Analysis into Principal 
omponents. II. Remarks on Lawley's Maximum Likelihood Method. II. nathan 


Theoretical Discussion. 


I 
ME SCRIPTION AND ILLUSTRATION OF A TEST OF 
NIFICANCE FOR ANALYSIS INTO PRINCIPAL 
COMPONENTS 


РЕ ооу Remarks. In а previous paper (1) in this Journal I distinguished 
i bos WO types of statistical analysis, external canonical or factor analysis between 
ê N of variables and internal factor analysis of a single group. By the latter 
Mm en the particular statistical analysis of the internal correlation structure 
ШОН роге familiarly as an * analysis into principal components. It was shown 
analysis ppropriate tests of significance of the * factors ? emerging from the statistical 
Was Vail b шо distinct in the two cases; but while a fairly satisfactory test 
SEI ined e pr the external case, no analogous test was given in the internal case, 
smallest T of assessing the significance of the smallest root but one against the 
describe a is the main purpose of the present paper to remedy this deficiency, and 
irm more general test analogous to the test developed for the external case. 
an anal X d be noticed that an analysis into principal components 1s distinct from 
i dom corresponding to some postulated psychological pattern, and has a 
iS perha рев ам Тһе most systematic attempt to cover the latter problem 
critica] disc at by Lawley (2, 3, 4), using the method of maximum likelihood. A 
associated БОП of this method is given ın Part IT, where it is shown further that the 
technique X test proposed by Lawley in 1940 has a partial connexion with the y? 
Ormulz ELE ee in І. For convenience the theoretical justification of the 
Which ma ашы described in І (and, to some extent, in II) is deferred to Part III, 
4 3 y be omitted by the non-mathematical reader. 
either reet paper does not entirely clear up the problems which it raises for 
to be worth of analysis; but the -procedures discussed seem sufficiently justified 
is still oa ne at this stage, even although further theoretical investigation 
descent X? Approximation. Let us denote by X (i = 1, 2, . . ., p) the p roots, in 
ng order of magnitude, of the determinantal equation 
|R—3| #0, (1) 

It should be noted that for the 
Let the total number 


Where 
Ri , B» i 
determinant 4 correlation matrix of p variables. 

| R | we have the relation | R | = 1 А: p. 


Bree 1 
7 E Of freedom available from the original observations ben; We shall usually 
с nean are employed for each 


Vari ا کے‎ ble 
able 1, when deviations from the general 1 1 P 
Present ¢ Where ^ is the total number of observations for each variable, i.c., in the 
1 Ontext r is the total number of persons tested." The statistical significance 
i er was defined as r —2. 

vor ds but the degrees of 


mote that the ae 
ree iS the yg Lumber of degrees of freedom s of Table I of the pr 5 
dom n Sn convention for the test of significance of a single correlation, 

A 1 for each variable is more appropriate here. P. 
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of the entire correlation structure may be tested if necessary by calculating the 
quantity + 

xX = —{n — 40р + 5)} loge | RI , (2) 
with 4p(p — 1) degrees of freedom. However, the usual situation will be that after 
the removal of the largest roots we require to test the significance of those left. The 
y test may still, as in an external analysis, be used for this purpose, provided that 
the roots removed correspond to well-determined (i.e., highly significant) factors. 
We now take 

x? = — {n — ‡(2p + 5) — $k} loge Крв, (3) 


with 3(p — К)(р — k — 1) degrees of freedom, after k roots 2; 29 . . 
been determined, where 


Ry-# = |8] H irae So [EM | p-k |, - 


It will be noticed that the test is formulated to indicate the significance of the 
residual roots ; however, it is often convenient to present a complete y? table, pro- 
vided this is properly interpreted. Since the multiplying factor in (3) changes with , 
the complete table is not strictly additive unless a constant factor is substituted + 
we might employ the smallest, corresponding to k =p —2: the factor 1s then 
nı — p + 3. But if nis not large, the full advantage of the approximation is retaine 
if the proper factor is-used for each individual test. 


», have 


Numerical Illustrations. Two examples will be given to illustrate this test. For [5 
first I have taken the data of Truman Kelly's quoted in Hotelling's original discussion G 
, readin, 


of an analysis into principal components, consisting of four tests (reading spee 
power, arithmetic speed, and arithmetic power) carried out on 140 children. For 
R we have ? 


the matrix 


1 0-698 0:264 0-081 

0-698 1 —0:061 0:092 

0:264 — 0:061 1 0:594 

0:081 0-092 0:594 1 
The variances taken up by the four roots were 

м = 1-846 

hy = 1-465 (RI = XX» 

As = 0-521 = 02353 

ым = 0-167 

Total = 3-999 

For the quantities Арл in (3) we find 

К, = 0:521 x 0-167 x (2/0-689)? = 0-7330 

R, = (0:2353/1:846) x (3/2154? = 0:3444 

Ry = |R| —02353. «e DE 
In practice when the smallest roots have not been evaluated | R| must of cour ain 
evaluated directly. Using the common factor n — p + 3 = 139 — 4 + 3 = 135:5, we 


the additive y? analysis : 


imatio? 
1 As noted in Part IIT, the criterion | А | was considered by Wilks in 1932 ; the X? aproxima fot 
proposed in equation (2) is closely connected with the 7? approximation I have put forwa 
external analysis (see also Box (11) p. 344). 4 for 
2 Тп order to make use of Hotelling's analysis for this first example these correlations, correcl ле 
attenuation, have been used, though of course this correction will affect test of significance 
(unknown) extent which is here ignored. 
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| 
| DF. | y: 
м | 3 | 135-5 (loge 0-2353 — loge 0:3444) = 51-6 
» | 2 —135:5 (loge 0:3444 — loge 0-7330) = 102-3 
xm | d — 135-5 (loge 0-7330) = 421 
М | = 
Total | 6 | —1355 (loge 0:2353) = 1960 
| 


E ber approximation for the total y? would actually be — (n — 40р + 5)} loge | R| = 
that thi Я (loge 0-2353) = 198-0, but of course all the components аге so highly significant 
gives is difference is unimportant, It may be wondered why the second largest component 
logic, 18 Apparently more significant 7? than the first; but it should be remembered that 
So Y the significance of at least one root is to be judged from the total x, not the 
to Оле Corresponding to A,. The apparent significance of this component according 
remai; above table represents the magnitude of the root compared with the variation still 
appa ning, and the latter contains quite a large second component. On the other hand, the 
Tent Significance of the latter according to the analysis of the table is judged in relation 
Variati residual variation after this component has in turn been removed, and this residual 
P comparatively small. . ч И 

Which "s lé second example I have taken the hypothetical data discussed by Burt (6), for 
Пе number of persons is only 6. It is indicated in the thcoretical discussion in IH 


tha { п 
nus "ау expect the test presented here to work satisfactorily, at least for the total у °, foras 


low Д 
asa c А 
Bouton оош 10 degrees of freedom.! This means that the test must Бе used with considerable 


п the 3 Xa ich i istic in its very small number of persons. In 
View present example, which is unrealistic in its very 1 ЖЕ 
form Of the small value ua the 7? table for this example will be presented not шеа 
E signifi in the form in which it would arise as each set of residual factors is tested in tur 
Шсапсе, the best multiplying factor given in (3) being used in each case. 


‘om Table ХШ of Burt’s paper we have 


4 = 31021 
ме: | R| = 3 de As Da 
^, = 0-1593 = 0-002101 
X, = 0:0058 
Total = 40000 
Hence 
Ra = 0.1593 x 0:0058 х (2/0:1651)° = 0:1356 
Ps = (0:002101/3-1021) x (3/0:8979) = 0:02326 
hey? tect pF RI = 0002101 . 
~ “St for the residual roots is thus : 
y: P = 0:05 
D.F. X level 
[1959.2 
= 17:47 | 125 
12A, | 34241 | —2833 (loge 0-002101) = 1753| 7.82 
چ‎ 24-1 2.167 (loge 0:02526) = ба 3:84 
j 1 | —1-5 (loge 0-1356) - 
d f ns, 
"or ese d spending on the number of persons, 
е de, CBTees of f, i f course the number Череп нр 
Brees of fied o Tune nadlag only on the number of tests: 
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hus able to conclude that the first two factors are 


Even with such a small sample we are t 
lusion in the final 


significant ; the third, however, does not reach significance (cf. the conc 
paragraph of my previous paper (1). 

Further Comments. It need hardly be stressed that I am concerned here with 
statistical significance. It does not follow that all the factors which reach statistical 
significance in a large sample necessarily remove a very large fraction of the variance ; 
and hence some of them may be comparatively unimportant in practice. Again, 
even if they are numerically important, this has no necessary implications of psycho- 
logical or other reality of the factors. Merely the correlation structure of the 
variables is being investigated in its relation to variance : for this reason no significance 
can ever be attached to the last root, for it would be equivalent to asking for the 
correlation structure of a single variable. 

If of course the basic equation (1) is replaced by another, then the above tests 
become inappropriate ; for example, if we ask for the significance of the factors 
of an equation like 

| R—3ei| =0, (5) 

where e;; denotes error variances (assumed known), then, as I showed in (1), the 
analysis is a special case of an external factor analysis, and other tests become relevant. 
Or if in place of the empirical analysis of equation (1) we substitute a factor analysis 
of more orthodox psychological type, then, as noted in the introductory remar 8, 
other tests again will be required. Some discussion is given in Part II of a factor 
analysis of standard type, which in vector and matrix notation is represented by the 
equation 

T = MoFo + МР, (6) 
where T represents the vector of standardized tests, Fọ the set of (standardized) 
general or group factors, F, the (standardized) specifics, and (My, M з) the factor 
“loadings. Since the difference between the factors of equation (6) and those 
corresponding to the analysis (1) is not always realized, it may be advisable 
to illustrate this difference by a simple example. Consider three tests E 
which the first two are correlated with coefficient р (positive), and the thir 
is entirely independent. On the basis of equation (6) this might be interprete 
as indicating one ‘ general? factor common to the first two tests, but with Zero 
loading for the third, the remaining factors being specific to each test. On the hes 
of an analysis into principal components, however, it is easily seen that the r00 1 
are 1 +p, 1 and 1 — р, and it will be noticed that in order of contribution to юла 
variance it is the second root which corresponds to the uncorrelated test anc, aa 
any reasonable size of sample, this will be significant compared with the last. Th 
once significant correlation is present it may automatically cause all the roots to ad 
distinct in a principal components analysis, which should not be assume to le 
necessarily to the simplest factor-pattern. 


II. REMARKS ON LAWLEY’S MAXIMUM 
LIKELIHOOD METHOD 


; e 

The Maximum Likelihood Method in Factor Analysis. To determine a 
structural factor equation of the type represented by equation (6), Lawl? the 
investigated two methods based on the principle of maximum likelihood. jing 
first, which he has called * Method I, is based on the variance-covariance $4 E 0 
distribution of the test scores ; the second, * Method II,’ is based more directly uld 


the individual test scores (see his paper (3) ). At first sight the second metho 


80 


i — 
ap RR P 
—— 4 mpm a E PED JP" 


M. S. BARTLETT 


appear the more fund it i 
paper (7 amental, but it is known to lead to difficulties. 
EC М. O: Kendal raised doubts on the suitability of the a A 
entertained E е, context ; and Dr. Lawley has informed me that he had alread 
following КОП. doubts, at least as regards * Method IL' In the Ree 
to another set B leche I pointed out that the factor equation (6) is equivalent 
definitely Bu structural equations (familiar in econometrics) for which it is 
The questio hat the standard maximum likelihood method breaks down. 
detailed B n remains of the validity and status of Lawley’s * Method I,’ and a 
or dealing оа and formulation of the most appropriate statistical procedure 
comments on fone structural equation (6) seems still required. Some provisional 
Lawley’s * ms ey’s first method may, however, be useful. 
Value of this ethod I; and its Associated у? Test. I was at first sceptical of the 
example, it tod because of its obvious breakdown in elementary cases. For 
Should then vid not work for the case of two tests and one general factor, for we 
‘our BPO, ve reo observed quantities, the two variances and the covariance and 
Standardized n loading coefficients (or equivalently, when the tests have "been 
he general CN: observed correlation, and two unknown loading coefficients for 
us to the plo tor) ; in other fields this case often arises in the guise of fitting a straight 
aret (8)). of two variables both of which are subject to error (see, for example, 
n 
to A apos d the number of tests is usually lar 
Covariance di he validity of the maximum 
istribution then depends (i) on our deci 


ion ¢ 
Ontained i B 
in the variance-covariance (or equivalent correlatio: 


o ty of the А ven iding efficient estimates. 
Nger auto maximum likelihood approach can no longer be assumed, (ii) is no 
demonstration. One possible argument is 

lways be introduced to 


to n 
ote tha; 
tak t enough factors and unknown coefficients can à 
id redundancy ; representation 


© up all : 
f 6 zr available degrees of freedom and yet avo! 
ation matrix by still fewer parameters lies further linear 
i is should hardly affect the question 


°Шсїепс: 
com Pati Y when the likelihood method is used. jy 
iti Patible wah e noticed that the method depends on the correlation matrix being 
5 the postulated factor-pattern. For example, with only three tests 
except in the case o 


r equation (6) 
11 the degrees of freedom. Yet the 


10п matrix 
1 0-60 —0:28 
1 ( 0-60 1 0:60 ) 
'S only ¢ —028 0-60 1 
eneral factors. This would have to be 
i and not by Lawley's 


r o Р i 
азый ов with a minimum of two Р 
Sho ate me ations impossible to satisfy with real coefficients, : 
md test th est. If the above matrix а mple, it is no 
With еу Lu us of fit of one genera t 
i т Uation (6) appears to provide a satisfactorily 
a : 
огап T ded stressed that this factor analysis is quite distinct from 
eco. C Gare paper; butitis an attractive link between th x 
Sarily th sponding to the significance of the unreduced correa 
diffe, wo a € same, and only because of the differe 
d nalyses does the analysis of the total 7: 


t clear how we 
uch difficulties, 


lfactor. If we ignore 5 i 
objective procedure for dealing 


nce betwee 
2 into its respec 


Tests of Significance in Factor Analysis 


It follows (see the last section of Part III of this paper) that the form of Lawley's 
test which should be more exact for moderate-sized samples is the determinantal form 


x ~ пов | R|/| Ê], 


where | R | denotes the estimated correlation matrix when К general factors (and 
р specifics) have been fitted, and the multiplying coefficient л” be taken, not to be n, 
but the coefficient n — 3(2p + 5) used in (2), or perhaps even better, the coefficient 
n —iQp 4-5) — 2k used in (3). The number of degrees of freedom of y? 15 
Жр — Ю(р — k — 1) — k (subject, of course, to the difficulties raised above). 


III. FURTHER THEORETICAL DISCUSSION 


Equivalent Analyses of Correlation Structure. Any resemblance of the test 
described in I above to the test for external analysis described in the earlier paper 1$ 
more than superficial. Let us consider the logical relations among a group wd 
variables, say, four for definiteness. The possible correlations may be considere 
in more than one way. Thus we may consider the correlation between variables 
x, and хз, then the relation between x, and the group ху, Xs, then between x4 an the 
group ху, X2, хз. As another alternative, we might consider the external relation 
between the two groupsx,, x, and x3, x, ; then the further internal relations betah 
x, and x, in the first group, and between хз and x, in the second. The interna 
relations in a single group can thus be regarded as a number of * external’ relations 
between various subgroups. Hence a test of the internal relations can be built uP 
from the tests of the equivalent external relations between sub-groups. If we ш 
out this theoretical programme according, say, to the first of the alternative analys 
mentioned above, where the relation of each further variable is considered successive у, 
We arrive from the theory of the 7? test for external analysis (9) at the successive X 
quantities : й 

— (n — }(3)} log. (1 — r2,), (1 d.f); 
— {n — }(4)} log. (1 — RE, 1, Q df), 


—(i—4»-D)og(-—mR8R,n...,.9, (p—1di): 
where ry, is the observed correlation between x, and x, Йе, иык = 2 He ШОШ 
correlation of x, with x, , xs ,..., Xp—1- In order to make апу test 
invariant to the route of arrival, it is obviously convenient to choose a mean here 
plying factor, which (weighting with the degrees of freedom) becomes л — af, м 


ipp — Df = (1 x 3) + (2 x4) + (3 x 5 +... (р — D(p - D: 


or 


| S= +5). 
It is well known that ш 


U — ril — А8, 29)... (0 — RE asses) = | RIS al 
whence we obtain as an approximate у?, with 1 +2 OE (== bp? 
degrees of freedom, the expression proposed in equation (2). t 
Direct Derivation of the у? Approximation. The above method of approach we al 
one I used first to arrive at (2) ; but it is satisfactory that the test finally arrived at arp 10 
on the correlation determinant | R |, a criterion considered many years ago Dy | vil own 
It is useful to check directly the appropriateness of the X? approximation by writing mul? 
the moment-generating function of —7 loge | R| , by suitably adapting Wilks’ p 
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for the 
e sth ord i 
analysis). ee онер of | R | (cf. the derivation in (9) of the у> test for an external 
Simultaneous distributi y the moment-generating function could be written down from the 
ion of the correlations R, or from the independence of the components 


listed above, We obtain 


T2- (4) T pan-—1)-— nt) 
(7) 


Me P-1 > 
Е Te? Qa — ni) П Г — 0) 
апа it is а matter of strai ici 

Or a [function to Сави aan algebra coupled with Stirling’s approximation formula 


M(t) hele m к 
E rai 2i „Г... 42i it i 
Баара аира. ] ee [EP 


: gn 1 з 
+ аро - 88 [1 — چ‎ + 2] Pesi (8) 


rst 
5]: Hence the improved approximation is 

firming the y3 ~- {п — 40р + 5)}loge | RI > Р 
ing арр — mean multiplying factor arrived at by the previous-argument, and from (8) 

BS Сілен; ) degrees of freedom.' 
Eat be sed the у? Approximation. The order o 
is Sequently b Table to that for an external analysis, in view 
or dom (9); 1 expected to work quite well do 
Tor Single Жз. us, for example, in the case о 
У correspondia is involved, check the closenes 
ing to the correct critical P — 0-05 or 


Con: 
hay; 
f closeness of this approximation will 
of their relationship, and will 
wn to a sample with about 10 degrees of 
f only two variables, where merely the test 
s of the approximation. We find 
0-01 value of г», the values : 


Р = 005 Р = 001 
n = 10 3-84 6-60 
п = 20 3:84 6-63 
п = o0 3-841 6:635 
the larger 


тесі of eliminating 


The approximate е : 
ossible to give 


sidered. To some extent it is possib 
liminated are 


ent a 
t analogous to that used for an external analysis, when the roots € 
sponding to these roots 1n the 


Become, Min Jj 
deten, rical ed, In this case the orthogonal directions corre: 
ne ined, I of the variables in the n-dimensional sample space will be well 
lMension: we may consider the analysis of the remaining p — k roots in à space of 
he aq; OPS (if K roots have been eliminated). 


adjus 
Ы tment of the multiplying factor gives 
S o: 
Biven į @= = 2ф —H +5) 1 1 
iven in (3) is 
ts analogous t 


merely à convenien 
i =A 0 


Eius given | R | and м Лг... $ lative V 
ernal analysis can only be on the basis of the relative "his explains 
чеп 


Pak 

EO, A wa a pF Ê in Rp 

-k 

| xil. d "n 
dap” inde Ay Ag... Ak 1р = п. 

ер Реп " i 

On Such ont and prior mention of this approximation is included in the recent comprehensive 
Proximations by Box (11). 
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Tests of Significance in Factor Analysis 


It must be admitted that the above justification of the proposed test for the residual 
roots is not as complete as could be wished. The problem would be slightly simpler if we 
were alternatively considering the analysis of test scores known to have true equal variances, 
but standardized to unity only for the mean variance. In this case the above form of the 
test for the residual roots can be given further justification, but of course the total у? now 
has p — 1 more degrees of freedom, these representing the test of homogeneity of the p test 
variances about their unit mean variance. Correspondingly, the total 7? for the last p — 
factors would have (approximately) (p — k — 1)(р — k + 2) degrees of freedom instead 
of 3(p — К)(р — К — 1). One point that remains in some doubt pending a more detailed 
examination is whether the reduction in degrees of freedom that ensues from the individua 
standardization of the tests is automatically felt in the residual factor components (an 
assumption implicit in the proposed test), or is mainly absorbed by the larger roots. 


The Alternative Forms of Lawley's у? Test. In his 1940 paper (2), Lawley gave 
two alternative forms for the large-sample y? test he developed for use with Ius 
maximum likelihood method (Method I). One was an arithmetically CORE 
sum of squares, but it is worth pointing out that the equivalent determinant formu i 
arises more exactly than he perhaps emphasized, and should therefore be the mor 
accurate one for moderate-sized samples. NUR TS 

To demonstrate this, it is convenient to define a ‘ homogeneous ' likelihoo 
Шоп from the variance-covariance sampling distribution by its logarithm ^» 
where 

— 2L + constant = n log | C | —n log | A | + n trace (C714) . (9) 

In this expression, where C denotes the true variance-covariance matrix and 4 Ta 
corresponding sample matrix, —2L tends to y3 with 2р(р + 1) degrees of freedom 
n increases. To determine the constant in (9) we substitute the estimates for Cs 
giving 

constant = п trace (4-14) = пр. (10) m 
If we alternatively substitute estimates of C from (efficient) factor estimates, wit 
q independent unknown coefficients, we obtain 


—2L + constant = n log | € | —nlog | 4| +n trace (C714) , (11) 
with 3 p( p + 1) — q degrees of freedom. 
From (10) and (11), 


—2L = —n log| A|/ | €| + n trace (C- 14) —np р 
= —п 108 | А | / | C | + n trace (6-14 — €-1€) 
= -nlog |4 1/11, 2 


ful ^ 1 i tes 
the last term vanishing when C corresponds to the maximum likelihood estima 


(see Lawley (2) ). ресїйб 


If no general factors are fitted, the estimates С consist simply of the $ FIL 
factor variances coinciding with the test variances, and (12) reduces to ~7 log | cip 
It is then, as we should expect, identical with the total у? proposed in the prin con- 
components analysis, and the modified multiplying coefficient n — 1(QP +>) 3° may 
sequently more accurate. When k general factors have also been fitted, (12) nie 
be expressed in terms of the ratio of the observed and estimated correlation deter ore 
nants. The more accurate multiplying coefficient n — 1(2p + 5) should stil P i 
appropriate than the crude coefficient л, and the further modified СО© 
n — (2p + 5) — $k would have some justification here also, though ees 
refinement is of course not very firmly established. The number of deg” ones! 
freedom comes out at 1(p —k\(p — k — 1) — k, the number lost рег сотр 
being one more than in the principal components analysis. 
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ally, it might be noticed that an alternative analysis of the total y? along the 

E А ep in the first section of Part III would sometimes be of practical interest, 

i p T was there pointed out, it would then rest on the known y? approximation 

DS ha H пус such an analysis would have a somewhat more precise justifica- 

ERES it has yet been found possible to give either for the principal components 
ysis or for the maximum likelihood factor analysis. 
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A METHOD OF STANDARDIZING GROUP-TESTS 


By D. N. LAWLEY 
University of Edinburgh 


Introduction—A method has been given by Thomson (1) for standardina 
group-tests, by means of which the scores obtained by a complete EU ы 
children: іп a test can be converted into “intelligence quotients’ or LoS Hte 
I.Q.s are not true quotients ; but are similar to Binet I.Q.s in that they make ТИ, The 
for difference of age, and are distributed normally about a mean of | ; agree 
standard deviation of group-test I.Q.s has for convenience been fixed at 15, to 2 
very roughly with that of Binet LQ.s. ihe 

The method of standardization depends essentially on finding for each month орава 
5, 16, 50, 84, апа 95 percentiles of the distribution of *raw Scores." Straight lines à 
fitted to obtain for each month the estimated value of each of the percentiles. spond- 
that for any given age within the range considered we can estimate the raw scores S other 
ing to LQ.s of (approximately) 75, 85, 100, 115, and 125. Scores corresponding 
I.Q.s may then be estimated by interpolation or extrapolation. ! and 


Я Bac . r some 
author developed a somewhat different method of standardization. This ШИ Де results 
years been used to standardize Moray House group-tests, and has been found to g 
very close to the former method, with considerably less calculation. to 


id en 
Description of. Method.—In developing the method the main idea x hi 
make the calculations as simple as possible, without too great loss eroe ur 
ction 
materially to the time required. Errors of estimation arising from the ا‎ 
of the conversion table are in most cases negligible compared with the 
measurement of any mental test, however good. f children 
We shall suppose that data are available for a complete year-group O sand. 
in some area, and that the total number of cases is large, say, at least a thou months 
will be assumed that the age a of a child is measured in units of complete e may 
We shall thus have twelve Score distributions, one for each value of 24 -grouP- 
further assume that there are approximately the same number in each month age 4 15 
Let us denote by x the * raw score" of any child ; and let us suppose that ue score * 
measured from the mean of the year-group. It will then be presumed that the = 
depends only on the age a and on the I.Q. y of the child, which is, however, unkno the form 
we may write x = o(y, a), a relation which may alternatively be expressed in 
у = F(x, a). 4 іоп 0 
Now it is clear that there is по logical necessity for F(x, a) to be a simp imation, 
xanda. In practice, however, we may usually assume, to a sufficiently good АЕР S co t 
that y — f(x) — Ba, where f(x) is some function of x only, while B is a p a wa the 
not dependent on either x or a. Furthermore, the І.О. y is to be chosen in suc 15. gine 
it will be distributed normally with a mean of 100 and a standard deviation o D 
Дх) = y + pa ; С mean 
it follows that, for a given value of 4, f(x) must be distributed normally with а t 
(100 + Ba) and a standard deviation of 15. Е t of ais ny 
hen a is not fixed, the distribution of f(x) will not be normal, since tha norm 
normal but rectangular. We may nevertheless treat f(x) as being very nearly 
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distribu: 

ted, si 

folal varie» Since the term Ba i 

riance су? m Ва is small ; and as y and ietri 7 

уап of of f(x. ae dias yiana d are distri 

da respectively. "is 8 ш Ве + Boa", where oy? EA т. Ше 
в oy? = 225 and са? = 12 (since the range of a is 12), we thus Ee 

» ve 


of = 225 282 
+ 129°. (2) 


It is fi 
of x, s rst of all nec i 
‚ Say, x essary to estimate th i 
Of кау» Xo, aiid ent e constant 8. To do th i 
en я Mee п each value of a, i.e., for each oo EE err 
do Г, ا‎ € han Xo. For greatest accuracy it is desirable te diae. Че 
aos ortesponding are ШП of the total score distribution. We then find for each val е 
за 9 mean re d in such a way that p is the probability of a а 
or each a e le I will be found Шш the angeo toz ШШШ 
Б ch value о anti 2 
constant x ing to a атр шап Ноз 15z m now provide an estimate of the LQ. y 
irat. Сазга: I Xo. er to obtain the relationshi yi af 
MESS . The wis line to це twelve values of z by the EA E en pd 
elve consider line when multiplied by —15 gives us a i 
and fo Month-groups. Ше total distribution of raw scores found by ааа em 
e scores will usually be grouped in intervals of 5 or 10 marks 


of child the sco: 

dren gee Corre: i 
plot p en in the oe to the top end of each interval we find the proportion P 
year-group obtaining less than this. We may then, if we choose. 
Iting points. In actual practice 


it is against x 
eas; x and dr 
sie d draw a free-hand curve through the resu 
f smoothing the values of P. 


asier to 

0 emplo А 
г à set S edis a simple, non-graphical method o 
paced values of x we thus obtain corresponding proportions P, 


Con May t 
verted j en be 
conv i : 
into erted into norma Zi 4 
z. ormal deviates i 
For these vates of E die d the same manner In which p was 


Wher 
Е Dr İS as given b f(x) = 100 + oZ, 
age тозлу А тев (2). Тһе теап and standard deviation of f(x) will then 
га, п by a = 0 Ve may regard f(x) as the I.Q. corresponding to а score x and 
‚ Since in dealing with the total distribution of score we have in effect 
ing intervals f(x) 


(3) 


ation. In theory it might bi 


Prob] Пе а, 

est; em assumpti E 1 

timating tS EF Ption regarding the mathematical form of f(x) This approach to the 
d owing to the difficulties i 


(1) , TO fing unk 
)and nad the е parameters thereby involved. 
* Hence rresponding to any pair of values of x and a we use both equations 
le yi y = fix) — ba 
= (100 — Ва) + 92. (4) 
owest value of a 


төс; 
Quireq? found convent 
" 
ay nient to calculate the Т.О.5 Yı corresponding to the 1 


»a 
Ro 1. Thus 
то 
һег Values of 33000 = Ва) + oZ. (5) 
= à the I.Q. y is then given by 
Peat’ 
Riv ney, у= у Bla = а). (6) 
f standardization à numerical example 
plete year-group for 


à Nen b ical Ey 
e ха! y Р 

2 low, Dd To illustrate the method o 
тоб fanis толо represent the score distribution 
Wo v. 5€ from Tor The ages are given in years an 

Tad P^ 2 = : 8 (10 years 8 months) to 11 : 7- 
xy ably i espectively chosen, namely, 49:5 and 79:5 ; these correspond to the sets 

y. The slopes of the two lines found by the method of least 5907 

i e therefore estimate 


035 lttle - 
DAP d 
b сс e Mum E and — 03514 respectively. 159.58) rs 
0 Dair of the gE t alue of су is then given b: RE 235 F 120528) rr Buts 
; gin ipl the Scones conversion table it is eost to use the Z valu onding to the 
The di ues p intervals, namely, 9:5, 19-5, 29.5, elc: We therefore in trapolate 
lffStence 2, 10, 20) 5 90. Using equation (5) we then fin the 1057 , for ag 
between the lowest age 10: 8 and the median age 41:14 is 
a 
87 


s of a com 


d completed months at date of 


of values 
ares differ 
p as 


izing 


di 


A Method 


of Standar. 


Group-Tests 


LV а LM кп OH вш сл сш Ol 66 6 $6 06 з 98 сз GL у 
т ш в vil — HI вог о чш 10 6 L6 $6 t6 6 68 98 а в SL 
81 її п т IH — 60i эх ют сог 001 $6 96 — *6 16 в 18 %8 б SL 
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H š 
ence, putting a, = —5-5, we find that 
i 100 — 0-528 x (—5:5) + 15-117 
Lr = 10290 + 15112. 
ages the I.Q. у is obtained by subtracting multiples of 8, according to equation (6). 


Fitting of straight line for xo = 495 


350 —:128 478 11 5:258 
225 — 118 343 9 3:087 
166 —-060 226 7 1:582 
:156 “110 046 5 230 
126 -098 028 3 084 
—073 -090 —-163 1 —163 
`950 —-008 958 10-078 


Slope = —10:078/286 = —:03524. 


= Values of Z and I.Q.s for age 10:8 


З * 7 x 2 1.Q.* for 10:8 (уу) 
95 (133-00) 
89-5 1:589 90 1-621 {зу 
85 82 
795 1-045 80 1-068 ҮШ 
75 . 
025 674 70 -691 nun 
55 :356 60 371 TM 
E ү 
Эз -080 50 093 0921 
45 : 
395 —479 40 166 10039, 
35 { 
К" — 459 30 —444 ED 
E —+772 2 —'7152 Guan 
26 —1:243 10 —1215 84-54 
* 
Figures ing 5 (80-00) 
che In Tackets are interpolated or extrapolated. 
gks e nearest whole number. Various 


the со 
of 4&5 May pa Vefsion table I. iven correct to th 9 
m4 апу 00 made on the Besse: ir m we may note, for example, that the median score 
Cent otages edian age 11 : 12 d to an 1.0. of 100, and that in each month-group the 
“and 9f child г dpcouspono 2 15 he expected values of 50 per 
16 ren with 1.0.5 of over 100 and 115 (say) are near the exp 


T cent. respectively. 
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LINEAR AND NON-LINEAR 
DISCRIMINATING FUNCTIONS 


By A. LUBIN 
University College and Maudsley Hospital, London 


а ds litative 
I. Problem. П. Mathematical Derivations. ПІ. The Prediction of lm 
Attributes from Qualitative Categories. IV. A Non-parametric Test of Sig 
V. Summary. 


I. PROBLEM 


" ork 1n 
The main purpose of this paper ! is to present a summary of 7 БЕ aon to 
discriminant functions. Discriminant functions represent a fairly b ups they are 
the statistical techniques that can be used by psychologists. Poe e mental 
Solutions to problems in prediction, Given certain measures of BAI 9 Can Wê 
traits, can we predict that occupation in which an individual will peer pattern of 
state that therapy is most likely to benefit patients who have a ae attribute 
Scores оп a personality test? In these cases we wish to predict a спа function і 
of an individual from known quantitative variables. The discriminan n individua 
used whenever there are a set of mutually exclusive classifications and a 


jtative 
\ А uantita 
must be assigned to one of these classes on the basis of a standard set of q 
Scores. 


In 
пе. . 
m " ewm А it was given а name ive 
Historically Speaking, the discriminant function was used long before it was £s of quantita t 
"she 2 int AINE ups (L^ ime 
variables, which is used to assign an individual to one of two mutually exclusive group Шш 


: à BS CHOI itive JUS at qne 
distribution, the assumption of a point distribution becomes untenable. But defini p that 


criterion), the multiple point biserial 
function of R. A. Fisher were all mutually proportional (4, 5, 30). 


8 jn 
lace J of 
Properly to appreciate the importance of the discriminant function we me, types m 
its setting among other prediction procedures. Table I illustrates four pos pre jc frois 
simple prediction problems.: In the last line the problem is raised of how id dons r (t 
one set of qualitative variates to another. Although there are special solu sta 


ае : been 
problem in terms of quantifying the qualities, the general solution has not yes 100 


in purely qualitative terms. One solution is considered in Section III, рр. 99 
Л in the told, 
* As will be seen, the writer is indebted to Professor Sir Cyril Burt for many of the 1885 Summ! fisen 
cknowledgement must also be made to Dr. M. Hamilton, Mr. A. Jonckheere, Mr. 


У sal who рау 
Miss D. Webb, and my colleagues at University College and Maudsley Hospital W 
invaluable sources of stimulation and instruction. 


pee 
* Somewhat similar classifications will be found in 4 and 16: cf. also this Journal, 1. Р! 
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TABLE I 
Dependent Variate | Independent Variate(s) | Method of Solution 
1. Quantitative Quantitative .. .. | Multiple Regression 
2 Quantitative... | Qualitative . | Analysis of Variance 
ax Qualitative | Quantitative 3 | Discriminant Function 


4. Qualitative 


| Qualitative | 3 


a quay most users of analysis of variance have never thought of it as a method of РШЕ 
dels ative variable from опе ог more qualitative variables. However, the simple probability 
each cud In setting up the equations of analysis of variance postulate a quantitative effec 
евогу of classification. They can, therefore, be regarded as prediction equations. 


imei: acing discrim; 2 2 alvsi i d multiple regression 
i 8 diser: a alysis of variance an ple r 551 
plies that th iminant functions in the same table as analy In the writer's opinion 


this discriminant technique is of the same order of importance. rs opi 
function Ue. Тһе F vocational ional guidance alone should find the discriminant 
unction ap ; field of vocational and educational guid: ane should ша e O 


We will no dispensable t Il fields of empirical enquiry, 1рћ 

sienio no longer ask АЧ e Med em * Ате Ке differences between the groupe statistically 

Used Ror he new questions will be, “ Are these differences of any practical use? Can they 
allocate individuals to their proper classification ? 


II. MATHEMATICAL DERIVATIONS 


titus Le Prediction of Quantitative Variates from Qualitative RUE e s 
Drop," ЇЇ, analysis of variance is the one-way classification. 1 intelligence 
e М İS а question whether a single quantitative attribute, such 48 me i 
Party, Predicted from a single qualitative attribute, such as membership in a p 
Пе basic equation is : 
e P Il 
Sant i the overa 
lass is equal to Y 
6 . Thus 4; is the 
best estim th class (47) plus some positive or negative error oa individual comes 
ftom y Mate of the score that can be made from knowing that 1201 timate of A; will 
iousty E Class. Тһе error of prediction is Es The ne ioe will à denoted by 
p € the mean of the n; individuals in the jth C255 \ we 
: e Xj. 
Will therefore assign to each member of the class j Oe ЕН by “equation (1). 
ti ^ of class membership has in eflect gere of points on a con- 
Mum ^s А Series of classes can now be represented v icization of the qualitative 
уйще Presented by the x variable. It is this metri 


a ]scrimir ant function Oossibie. 
ship that m akes the linear discrim: fi poss. ble 
ination from this type of reasoning. 


Will y, ter derive i iscriminant fw I | an estimation 
cg D important to es that equation (1) is eSSenU minimizes the 
Im of s, dt gives a predicted score (x.)) for each individual W 

"qua i i i ; ху and 
D TI Ted dis ted score Xj. 
S. ме авон pred ардон correlation between the Prion of E predig 
Score ху wi io of the sta "s correlation 
Dr Feli Ше Hands ШЫ pup observed pac mu each class an the 
“ше, Tatio, ега, And we may recall that giving © ratio. (566 16, BB. 2 


GS О: 
“ore for that class maximizes the correlati И 


Linear and Non-Linear Discriminating Functions 


We shall now give the usual analysis of variance equations leading up to the F ratio. These 
equations will be used later in the derivation of the linear discriminant function. 


Substituting the least-square estimate for A; we can write from (1) that zj = xg — XJ- 


Sub- 


tracting the general mean X.. from both sides of equation (1) and substituting for 2, we get the 


standard form 


(xj — à. 


) = (@у—х..) + (xy – 5.) 


(2) 


The deviation of the observed score from the general mean equals the deviation of the Eum 
mean from the general mean plus the deviation of the observed score from the group man E. 
square each term for each individual and sum over the nj individuals in each group and then 
the c groups; we find the well-known important result that 


j=li= 


Š Š Qu —x):- 


S nth) + X &, (xy — &4* 


j=1 j=li= 


(3) 


; : yiance 
The deviance about the general mean equals the deviance between group means plus the devia 
within groups. The equation for the square of the correlation ratio can be written as 


R? 


deviance between groups 


total deviance 


The equation for the F ratio can be written as a function of R? 


F = 


which shows the basic relation between 


Derivation of the Linear Discriminant Functions as Canonical Variates. 
now use the above five equations to metricize qualities and derive a linear 


N-—c DRE 


е=Л ° І- Е” 
the two statistics. 


e 


(5 


We can 
function 


which will best predict the quantized attributes. The derivation has been е aa 
R. A. Fisher for the discrimination of two groups (11). An extension to mor 


two groups was suggested by Burt for psychological and education 


and (with a different line of ap 
C. К. Rao (23). Rao's somewha 


proach) has also been described i 
t brief derivation was given in term: 


Mahalanobis’s D?. Since the D? statistic is unfamiliar to most psychol 


derivation here (which is similar to that of Burt) will be in terms of maximi 


which will also maximize the F ra 


The F ratio (or its equivalent, the correlation ratio) tells us if class me 


tio. 


al purposes er 
in an article , 
sof maximizing 


ogists, the 
izing ^^? 
mbershiP 


variable. 


significantly differentiates the individual with respect to the quantitative d assign 


We will now invert the problem of predicting quality from quantity, e 
numbers to each category from the quantitative variables in such a way as to 


the F ratio. 


The question is put in this manner : Given q test scores for the 


jth class, how can we form the li 
maximum ? 


near combination of the q scores that 1 


Let y be the best linear function and let ijs Has хаз Шу be the best 


The notation will be : t represents the tests and runs from 1 to q3J represents t^e Г 
and runs from 1 to с; i represents the individuals and runs from 1; to 7; within 


re 
each x weighted by the appropriate 


In matrix notation : 


where yy is a column vector of N ele: 


ith person 1 
makes 


aximiZe 


the 


s. 
q group’ 


h 
: ss x scores V ү 
N= > nj. Then the y score for each individual will be the sum of all his 
и. 
Р 1 — (6) 
Уб = ИХ + UXeij +... xt + ...щхш = Ў ШХй. 
ix 
JN = Xiu, core тай 
Jy is ments, one for each person ; Xj is the nd vect mo 
of N individuals for q tests, whose general element is ху; and и is the colur nct! 


weights, one for each of the q tests, 
N is assumed to be greater than q. 
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Let us i А В 
for the E cs for yy in the analysis of variance equations and state the formula 
hus (y; y у ананан ratio, R*, in terms of Хи, and matrices derived from X;;. 
+) = (vy — Y) + Oz —у..) in matrix notation becomes 
Xu = Хи + Xu, (7) 
served өз N by q matrix whose general element is (ху — X), the deviation of the 
Dos an x = test г from the general mean. 
served score y q matrix whose general element is (xij — Xj), the deviation of the ob- 
A on test ¢ from the mean of the jth group. 
X; — X»), the deviation of the group 


is ^ 
mean fro an N by q matrix whose general element is (xij 
ie pos general mean. i 

set of weights defining the linear discriminating function. In equation (3) 


the su 

m of squa » ^ 
Broup means plus | ae deviance about the general mean was equal to the deviance between 
Notation : s the deviance within groups. Writing the same equation for y in matrix 


Where Y i 
ББ X is 


u'X'Xu = wX Xiu + u' Xj Хи . (8) 


Our notati а 
1. ر‎ will be simplified if we use the following definitions. 
Tow and the fih Cola, general” dispersion or codeviance matrix. The general element in the sth 
n 
Sus Ў y m 
Matrices a = (xj — З.) — Xe... It is a square symmetric matrix of order q, as are the two 
е 
ned below, W and B. The element gs = Nrs 9: 61 - 


ох 
^X X == Е 
W, the “ within groups " codeviance matrix. 
с "j 
w= X XQu— Fax — Xt)» 
j=1i=l 


3 X ^ 
ЗАХ = A 
2 = B, the “ between groups ” codeviance matrix. 
с 
Ьа = Ў т Xej— Xs.) tg — х..). 
Pees three g by #= iu E 
Xample G eee matrices are analogous to the single variable deviances in equation (3). 


"M Th 
Simi ® fo e . i 
imilar xq Ula for the square of the correlation ratio of the discriminating function, у, is 


9 equation (4) 
(9) 


ating function. we 
11 be to differentiate 
g equation for u. 


Th R? = u'/Bu|u' Gu 
Sh € formula u' Bu Я Mw 
Re 1 зеск ed for R? will now be used to derive the linear discrimine 
We vith respect ctor и which makes R? a maximum. The procedure will 
Obtain to u, set the derivative equal to zero and solve the resultin; 


dR? _ (иби) 2uB — (u'Bu) WG _ 0. (10) 


(9, Multi | du (и'Си)* ; : 
), simpli equation (10) by the scalar (w’Gu), substitute Re = “Вии Gu from equation 
> and the result is 
Stmulti | wB кис = w (B R26) = 0. (11) 
Plying by G-1 
(12) 


This, ; ‚ире T 
Sugee 18, in sli u (ВС: — К) = 0. 

tio; Beste ; Slightly different s the equation reached by, Burt 

fact, è n "da үз also be с паа ordinary expression for A is сог 

Bg lysis aay b [rc to the multidimensional case by mea X 

on (12), and t 3 ve seek is the laten! 

отат SOrresp Sd and the conditions of the problem, tell Hn Шар | bue by the SEA 

INE areful to pre-multiply by 

e used if post-multiplicattor by a 


The transposed mat inable 
VERE ‹ posed matr | obtainable. 
ctor is desired. The largest latent root 1 
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Equation (12) also implies that there are as many latent roots which are solutions as 
there are linearly independent rows or columns of B G-'. The number of independent 
linear discriminating functions is therefore equal to the rank of В G^. There are definite 
limits to the rank of B G7. С must be a non-singular matrix, i.e., must be of rank q, or 
else the inverse G-* will not exist. On the other hand, B has a maximum rank of c — 1. 
This is a result of the fact that the sum of any column of X; equals a constant, zero. This 
means that the maximum number of independent rows in X; is c — 1. So if q is greater 
than c — 1, the rank of B С! isc — 1. If c — 1 is greater thang, the rank of B G7! is q. 


While obviously the best single linear discriminating function is that one which is associated 
with the largest latent root, the set of all linear discriminating functions defines the number o 
dimensions which the groups occupy in g dimensional space. 2 ich 

Multivariate linear discriminating analysis can be likened to a form of factor analysis whit 
extracts only those factors which help to distinguish groups from one another. J therefore sugges 
using the phrase “canonical variates ” to denote linear discriminating functions extracted in He 
manner. Any linear combination of scores on which the group means tend to be equal will s 
discarded automatically. This raises the possibility of experimental verification of postulates 
factors. If a factor can be defined in terms of groups of homogeneous individuals, then the, а 
criminating function which maximizes the discrimination between these groups satisfies the definitio 
of the postulated factor. 1 U. 

The set of latent vectors associated with B G^! will form a q by r matrix which we can al to 
U cannot be compared directly with the usual factor analysis matrix F. F is usually use set 
denote the set of correlations of q tests with r uncorrelated factors. It is therefore the 
of regressions of the tests on the factors. U is the set of regressions of the r discrimina 
functions on the q tests when the standard deviations of the tests have not been made equal to s 
F can be turned into a matrix of regressions of factors on tests by one of several formulz. sio 
is one such formula (where R is the q by q correlation matrix). When multiplied by the appropri e 
diagonal matrix of standard deviations, these factor regression matrices will be directly compara ts 
to U. Tests of significance for hypothetical discriminating functions versus observed discriminan 
have been discussed by Rao (24) and Fisher (14). But special procedures will have to be wor 
out for verifying factors found in previous factor analysis studies by comparing U with F. ., ting 

It is only permissible to define a factor operationally in terms of a linear disci a 
function, when that factor has itself been defined in terms of a discrimination between Ec are 
H. J. Eysenck, however, in his studies of the * dimensions of personality ' speaks of factors. Wisorder: 
defined in terms of their power to differentiate between various diagnostic types of mental. 1 using 
In a later article I hope to explain my view more fully by means of illustrative examp es, 
(amongst other material) data drawn from Dr. Eysenck's own work. her à set 

Multivariate discriminating analysis can also be used to answer the question of whet n E the 
of categories really form a single linear continuum. If the rank of U is greater than one, t eating 
groups cannot be represented as points or intervals along a single scale. The best linear discrimIn. les 
function would still not suffice to represent the differences between the groups. For схава 
discriminating function analysis could be used to test if political groups such as Liberal, Conserva 
and Socialist actually fall on a single dimension of Radicalism-Conservatism. . ting functions 

When the covariance matrices of all groups are assumed equal, the best discriminating Ad an 
are all linear in form. A full discussion of this case with a worked example is given by ** 
Slater (25). А further discussion of this point will be found in Section II (b) of this paper. Л 

DT DANS š m— а axl 

The Prediction of Qualitative Attributes from Quantitative Variables by the MT 


mum Likelihood Discriminating Function. The underlying weakness in the single 


approach to discriminating functions is that unless the categories lie on “tifyind 
dimension they cannot be adequately discriminated from one another. (oy sing 
o 


the categories may force an actual multidimensional set of qualities on t rea 
straight line. The procedure may really be distorting the data and obscuring 
differences between the categories. ids this 
The maximum likelihood discriminating function developed by Rao avo! : сеп 
difficulty (23). In this method, the pattern of scores is used to discriminate ү cores 
groups of individuals without the step of condensing each individual's set O = th 
into a point on a linear continuum. This basically different approach release уе 
psychologist from the necessity of forcing quantification on essentially ача! 
data. ук 


А full discussion of the mathematical basis for the method will be found in the parer ih (27). 


More elementary discussions will be found in the papers by Welch (29) and С. А. В: 
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Smith applied the maximum likelihood method to the problem of discriminating two groups. His 
olution was a likelihood ratio which, interestingly enough, simplifies to Fisher's linear discriminating 
pen When the variance-covariance matrices of the two groups are equa. — И 
oe Еду stated, the method consists of calculating the likelihood for each individual a respect 
ES aries The individual is then assigned to the group for which he has the maximum likelihood. 
Smith’ ull method would lead to as many likelihood scores for each individual as Ше were OD 
grou 3 Solution was to calculate the likelihood ratio, the ratio of the likelihood of belonging oane 
asing to the likelihood of belonging to the other group. When there are only two КО: mi d 
Sons escore. [tis possible to examine the logical basis of the maximum likelihood so un hout 
NU deeply into the mathematics of the problem. Smith gives a simple explanation i о: 
Шар due. In the following we shall consider the simplest case that will Ше е E Шорц 
at th 'Scriminating method, namely, the case of three groups. Let us, however, begin by g 
€ univariate or linear case. и 


Given thre B. and Group C, who have taken 
à si € groups of people, Group A, Group B, a А ү 
of isle test Y, how can we allocate each individual to a group, merely from а knowledge 


15 Score On X, so as to minimize the percentage of misclassifications ? The logical structure 


Of thi d iscla ons 
this question is identical with the problem of quantitative prediction. A set of mutually 


IET : categories i Ori a quantitative 
Vari groups or сг Ue a alitative variable just as a set of scores 15 
lable, gories is a qualitative va iab iati З 


locatior i i ivale rediction 
e n to a group is equivalent to pr ‹ ‚4 qui ч 
importance’ of minimizing the ролине -of misclassified eae phe 
“nee and pl " he least squares co ecce С) 
Squareq plays the same role as the lea enl li t 
ien ; inear discriminating 
function ae of prediction. (It will be recalled that the canonical linea 


Scribed in Section I was derived from least square considerations.) 


"AE dep" individuals possessing 
Scores тар Of the three groups is given in Fig. 1. Ху, Xs and xa are individuals p 


G 
Indivig n X at the point indicated. The ordinate Lj represents the диш dee 
lu ore that may be expected to have the score xı Similarly, the ordin a1 
“sent the number in Groups A and B that will have the score Xı. 


ents 
respond tot take any specific individual who has the score xy and say that kc TR d 
e тетра to the probabilities that he has come from these various groups. uu edt any individual 
longs t yee of cach individual at the beginning of our problem, the probability that a 
rer the S вт is ei r unity. j e three 
эшн б aeaa iz ehte дно P inira y p. A. Fater may р ша Мр ie E 
Е Present the three likelihoods that a person with score ¥ WI ^s ig tO ais assigned (0 
oup fo 145 demonstrated the important theorem that if е eee a minimum. Th 
Or Which his likelihood is greatest, the number of misclassi s ber Шке СЕ Б 
aving the score x, would be assigned to Group vere equal numbers іп сас 
UD it iş 9 be assigned to Group Band Group C respectively. 1 there ifications. If the number 
nt People i Obvious that this NOU lead to a minimum number of misce arana S aey ЖЕДЕЛ} 
foist be wei, Ве Universe differs from the sampled number in the group, ШО his universe. In what 
Ооуу, it wae у an amount proportional to the number in the Eo Ын аге equal. 
tha, I wil © assumed that the proportions of the three groups іп the анас AR 
an др o © Noted that there is an interval, a, along the a ane Econ to Group B 
aly Sater tha Similarly there is an interval b where the euer d ойе likelihood is 
CH i C, 
f ۹ es Ta, other likelihoods, And for all ap such regions exist and can always 
Sund if Or Group C. Rao’s theorems state th E. | h 
qua) OW the © ordinate for each score in each group 15 here the likelihood functions агі 
al, е yy Point which separates the regions is the point w er the distribution function n 
A, thee Point where the curves intersect. If we knew th ate misclassified cou 
асц" the number of individuals from Group 4 чор ie mal of La over the 
‘val $, ^e. The number misclassified as Group B is equa assigned to Group C і5 te 
not „Under pp Milarly the number of Group 4 individuals mi noted that as yet We 
тите 2 4 frequency curve for the interval €. , It wi Love statements а 
atter Ww AY specific type of distribution curve ; 


re true 
is gi Bu Огт the distribution function takes. E 
1 rt stribution А e two 
Biven by m Shown (see Appendix, p. 104), the point X wher 
rmula 


al curves intersect 


7 Я N10? | 
Ce (бл, — сх) + сусу (= "m in) + ЕЯ 
Га © 
E 95 


Linear and Non-Linear Discriminating Functions 


— | 


X; X X3 
a b C 


Ез 1. Discrimination: the one-dimensional problem. 


Fic. 2. Discrimination : the two-dimensional problem. 
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where X= 
I = th Е ; ES 
e raw score at the point of intersection, X, =the mean of the first group, 


X= the mea 
9, = the а E. second group, o, = the standard deviation of the first group, 
ES inberin non of the second group, Ni= the number in the first group, 
Point is midway bety le second group. When the N's and o’s are equal the intersection 
» Seems to have bi ween the two means. Burt, who first obtained the above equation 
or cation of individuats os to propose the point of intersection as the cut-off point 
et u viduals into groups. 
graph with al pa the bivariate case. We can plot ea 
plane of the yas the axes. If we think of frequency as 
the relative fre graph, then each group resembles a hill. 
group. Fig, н су with which a particular pair of scores 
is , I2. Fig. 2. a rough schematic representation of the situation. 
gcobvious that TOME E have bzen used to indicate height, i.e., the frequency element. It 
Dub an the Ks ше where all individuals have a greater likelihood of belonging to one 
ше ас that they Moa EI methods of obtaining these regions (23). For our 
Y e ent. 
поса d CE eee aaa it would be necessary to compute the three likelihoods for each pair of 
that eens resultin de to the group where his likelihood is greatest. The number of mis- 
uld arise one rom this procedure is always equal to or smaller than the misclassifications 
person e same Procedu only one variable. 5 
Àllocat; the three likelih can easily be extended to the case of more than two variables. For each 
Of gr ion would be VA (one for each group) would be calculated for each set of scores. 
he group with the highest likelihood. It is clear that increasing the number 
The maximum 


ch person on a two-dimensional 
a third dimension orthogonal to 
The height of the hill represents 
хуу Хау», Us is found in the 


like; PS merely i 
ihood Miers the number of likelihoods that must be calculate р 
ating technique is, then, applicable to any number of groups. Increasing the 
i d functions but does not 


Dumber o 


incre; quantitative vari 
ase tpo Antitative variables increases the complexity of the likeliho» 


B € num To de 
in is Teal difficulty of likelihoods that must be computed. UND 
Sroup i у arises in the actual calculation of the likelihoods. Тһе number of individuals 
uld be determined for grouped values 0 


One уар p OP is, of i 
fre, table, but there finite. Empirical frequencies со T 
Won cies of Ene task would Eod become T apossible if we attempted to determine the relative 
the p, demand an j Pair or triplet of scores. To actually determine these proportions empirically 
Tediction of à ! possibly large number of people in the sample. This same difficulty arises in 
qualitative attributes from other qualitative variables. It is discussed in Section ш. 
rectang withi be an assumption about the theoretical nature of the 
M БО In each group. Special considerations may lead to the selection ofa binomial, a 
best representing the shape of the distribution 

function will prove 


n, or other frequency distribution as thes 
the normal multivariate 


the O use 

со at А 

anq stants уреогенсаї frequency distribution it will be necessary to calculate, 

quand еа e the function. In the case of a normal univariate distrib j 

р; Mon ions are sufficient. The score of any individual may then be sub 
dual will have as many likelihoods а: 


Or each 
group. Therefore each indivi 
calculation of all means, 


tion f ane, 

Сопу eviations, Ве айе normal distribution УШ їз о 
= ati i ас! 

degne . The likeli inter-correlations. The inverse os ea 1 

е Беу К Сеп likelihood function for each person will involve a generalized q 


ts a т 

} 5 are the elements of the inverted covariance 

рагу aS hi etail by C, A nts of the inverted CO 

y is d. Оу C. A. B. Smith (27). Since Smith is 

di thm of ihe iminant function de of the two likelihoods, Ог, rather, ме dee 

КШК eio Road ratio. It will be recalled that for t i 1 

simpli ik argest likelihood. Even so the keiten func. or 

i ad o ] 

minat bles which pd 
tion CO! А 

0 : dekh the optimum set of predicto rm ut 

E апарате is unworkable for а large number of variables E E а d 

E ES ee Lo Penrose od as to be linearly independent 


с 
The condensed set of variables 5 


ең 
al 
t of fd Sch 
in „егу tolastic Те, M obl 
еси 515 (1921), p. 165. conditions of the P 
сер ПОП [Pee qe 1), p oe me ondi he sign before the radical to PeP impossible: 
ere dud з the relative size of di and сз OF of № and Ns Bt logarithm of, = 
below, I use * In" as an abbreviation 1° 
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since the multivariate normal distribution requires the inverse of the correlation matrix. This 
inverse will not exist if the variables are not linearly independent of one another. 


(a) The Non-Linear Discriminating Function for the Case of a Multivariate Normal 
Distribution. The multivariate normal distribution is of special interest because so many 
psychological variables are distributed normally. When the variables used are themselves 
the sum of a large number of variables, it is very likely that these sums are distribute! 
approximately as the normal curve. This follows from the central limit theorem (9, p. 232). 
It is possible in the psychological field to construct variables that closely approximate а 
normal distribution of quantitative scores by obtaining the sum of a large number of repeated 
measures of the same trait. The discriminating equation will accordingly be derive 
assuming that the distribution for each category or group is multivariate normal. 

A matrix notation will conveniently be used. To distinguish vectors from scalars, the letters 
denoting the variable elements will be underlined when they represent vectors. Let Cj=the q X d 
covariance matrix of the jth group, x:=the 1 by q row vector of observed raw scores for the it 
individual, dj x; — ту== (ће 1 by q row vector of the q scores expressed as deviations from the 4 
means of the jth group ; dix = xi — ть, the same raw score vector expressed as a deviation from the 
vector of means for the kth group. And let 1,==the number of individuals in the jth group > 


[4 А 
N= X ^j; p= n/N ; and fixi) = the frequency of the set of observed raw scores x: in group J 
ja x 


There will be c discriminating functions, one for each group. The standard operating procedure 
would be to calculate fj (хт) for all the c groups and then assign all individuals having the pattern 
Scores, xi, to the group where the frequency of x; is highest. This will minimize the total number 0 
misclassifications. = à 

The equation for the frequency of x;, assuming a multivariate normal distribution, is + 


f = сауат PI} go C dn. а» 
Similarly the frequency for x; in the kth group would be : 
AG) = ya aT P~} de Code. (9 


е 
Thus the frequency of any pattern of scores, xi, can be calculated for each of the с groups. x 
can simplify equation (13) somewhat. The term (2z)"* is a constant for all calculated frequen 


for all observed values of Xi, and does not need to be considered. 


Let Ly = + 2 In [fj (x)] + 4 In 27. (15) 
Then Ly = 2 In() — In | Cj | — dy Ci?)dy . (16) К 
Ly is a quadratic expression which can be used in place of the frequency. When the frequency 


. Б RSI SP ok REB it 
xi is near zero, Li; will tend to minus infinity. When the frequency of xı is unity or greater than unity» 


Ly will be positive. umber 
, There is still another form of the discriminating function that is useful when the absolute П 
of individuals in each group is not known, but the relative proportions are given. 


Let dy = 2 In [49] + gqin2z. a7) 

Then =2 In py — In | C | — dy Gdy. (18) 

Where ру = nN. (19) 
^ will vary from minus infinity to the maximum positive quantity of (g In 27). d pilots 
Thus a personnel officer may not know what total number of bombardiers, navigators, Ni Force 


will be required. If, however, he knows what the relative proportions should be in the А! chools 

as a whole, he can use 2. This will allocate all available individuals to air crew training 5 

with the smallest number of misclassifications. mal, 
We have thus shown that, when the distribution within each group is multivariate nor for 

the best discriminating formula for each group is a quadratic expression of the 4 sco 

each individual. 


spi bution 
(b) The Linear Discriminating Function for the Case of a Multivariate Normal Dist! ibu 
Let us assume that Cj-06 -...—C. (20) 
Then ^j = 2Inp; —In |C| (xi mj) C7 (xi — my. QD 
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The quantity (—In | C | ) will be constant for all groups. 
xy +In| C| —21np; = Xi C-'x + 2xi Co тў — mjC^ тў. Q2) 
In equation (22) it can be seen that the quantity —(Qi C-'xj') is a constant and does not vary 
ps the с groups. It is now possible to define a new term, lij, which will be directly related 
0 the likelihood of x: in group j ; but will involve only linear functions of xi 
lj = 305 + In| C | + xi C7 xi) = хі Co mj + In pj + уту C7 тј . (23) 
Equation (23) states that li; is calculated for any x; by applying the q weights given by 
m; and adding the constant term, In ру — àmj C my. When there are only two 
frons, li, can be shown to be directly proportional to the point-biserial multiple regression 
a nction, or to Fisher’s linear discriminating function. Rao and Slater give a worked 
ВБ for three variables and six groups (25) іп which they use a function called Li. In 
IS article /, = L; . 
k Summary. Section II (a) shows that for a multivariate normal distribution in 
ach group, and unequal covariance matrices, a quadratic function of the observed 
S is the best discriminating function. Two such quadratic functions have been 
p Li; and àj, where 
1 و ر‎ Gp + q In 27 = 2 In nj — In [6 | — dj Cj 
n “2 


Жы а AM omm ; 

j = 2 In [409] 4g inde олар —n 1G] СГА 09 

pres j h of the c 
› be used where the number of subjects, у, to be allocated to each o 

to apsis known, Aj is for use when only the percentage of subjects, pj, to be allocated 


9 each of : 
f the c groups is known. NE. 
co Section II (b) shows that, for a multivariate normal distribution in each group 25 
nstant covariance matrix, C, a linear function lij» of the observed scores, is the 


est discriminating finetan. 
2 tj 21 52] +qin2n + In \с| +x CoP x 


1 


dj 04) 


ог =з» C-* mj + In pj — ът; С 1m; " (26) Ж 
às define sideration of the equations for Lu, Jun ай ft will show ether a that elass bed as à 
ante aye lee m elo ary as the quotas for these jobs vary- Ifan 


T Or a pilot b 1 int 

Tejec, Dt Criterj t, by a selection officer, 15 KC). ^. . ie quotas А =) 

wisction fron ОП of class membership із wanted, it will я Ls ay ost aco nee Ie 
tine below this critical value wi 


from, the? reser cach group, Ly or the other two equati t 
M the grows! dence lling belo 


this confidence level. All individuals fa 


A 


sing test scores 
lem of using е5 many 


1 ntl А 
iff Y, th 
otpa Йегет here has ч + t and researc h moro. a 
ale have A diagnosis of mental disorders. Rapaport Об), абы, (18), Rabi ( D smote o 
Soreg $ Patients 09180 the use of certain difference que ишо [идей tha 
:ceniminating functi it 
triate T :ch the discriminating | ds is mu 
Ariat UT preyio 15 is the very type of problem for which th a distribution о Еа) ne 


p 
Tati, no us dis А hat whenever s 
S Im cussions it should be clear thal е О te norma 1 
the vapore ien or can be satisfactoril approximated by а multivar оге can be effective ony 
n ily appr A difference maal discrimination wil 


i zn a i - " + 
цене ROSS and 22 ріпа! method of discrimination. ien 


a n 
sing covariances are constant over the groups. 
a quadratic function of the scores 


11 
ATT Rip Тив PREDICTI es 
the In Tab UTES FROM QUALITA vag were tabulated, 
Westio et above, where various types OF P'S igs was 
Of predicting qualities from other qualities 
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solution. Actually many statistical techniques have been devised which deal with 
contingency tables, the basic data for this type of prediction problem. But all o 
these techniques test the deviation from a chance distribution, or measure the amount 
of association in a manner comparable to the correlation coefficient. What is needed 
is a workable solution to the question of how to predict class membership from 
knowledge of membership in a set of other classes. 


Suppose a population of N individuals is divided into categories Ау, As, . . . Aj · + Ap 
which we wish to predict. These will be called the criterion categories. Suppose we are 
given the further information that each person has a secondary classification in CHEST 
Bı, Bs, .. . Bi, . . . By which are to be used to predict the criterion categories. These WI 
be called the predictor categories. 

We can now make out a q x p matrix whose general element is лу, the number of 
people who are in both the row B; and the column Aj, The principle of maximum likelihooc 
gives a direct simple solution for the best prediction of 4 from B. For any specified row ЛА 
find the л which is the largest in the row of values, Tii, Nias a = ia Hi, eee Т» If nis 1 ы 
largest frequency in the B; row, then the best prediction from В; isto Aı. Thus a particu a 
A category will be predicted from each B category. More than one B category will Seed 
times predict the same A category. There is no necessity for each B category to be lin! T 
with a different A category. It is easy to see that this method of taking the A category 
which has the highest frequency in a particular row and making it the prediction for t E 
B row results in a set of predictions that minimizes the total number of misclassifications- 
This is the simple situation that prevails in the case of the single predictor variable. á 

In the case of two predictor variables, we are given the additional information that t а 
individual has a tertiary classification on categories Qi Gir Oh vus Gre “Ths leads to а 
three-dimensional table of q х p х r, whose general element is пиу , the number of pea 
who have a joint membership in Cs, B;, and the criterion category Aj. The E n 
likelihood method for the case of two predictors is formally the same as the case of predict! 


s Ў ГАР i as 
from B alone. For each combination Cr, Bi, find the prediction category Aj wish d 
the largest frequency. When this category 4; is taken to be the prediction for an pru 


joint membership in C; and B; , again a minimum number of misclassifications wi 


But the number of possible combinations of B and C is qr. Asthe number of discri 
tors is increased arithmetically, the number of combinations increases geometrically. ched 
any sample is finite, as the number of predictors is increased, a point will soon be тей 
where the predicted А category from a particular subclass of the discriminators W! 
indeterminate. There will be no individuals who fall into this particular subclass. 


Another difficulty is that the predictions must be established by a detailed investigate can 
empirical frequencies in each case. There seems to be no method whereby a set of pan OH of A- 
be calculated which, when applied to the observed set of predictors, will give the best predic "that the 

Each contingency table could be first tested by the usual chi-square technique to ue can be 
observed frequencies are unlikely to have arisen by chance. Coefficients of association | is the 
computed for contingency tables. But the best measure, in view of the method we аге 05108; 
percentage of incorrectly classified individuals. Research 

R. A. Fisher has presented an interesting technique in his Statistical Methods for cores 210 
Workers (15) for quantizing the qualitative variates in a two-way q x p contingency table. , п betwee? 


ion of 


assigned to categories A and B in such a way that the Pearson product-moment correlatio: pvious: 
the quantized variates is a maximum. The extension to the multivariate case does not seem t each 
The basic difficulty with this approach is that it assumes exactly what we wish to avold—"™ 
category is a point on a linear continuum. esenta” 
On the whole, the solutions presented here seem inadequate to the writer. Perhaps the PIA cing 
tion of the problem in this Journal will focus attention on the question and reveal that some "^ pan 


technique can be used.! Most responses to situations are easier to classify in a qualitative т. 
a quantitative manner; and the psychologist would find a great deal of use for a techn 


sable, 
* Professor Burt suggests treating each composite class as a * determinate’ for a new ' determi the 
and then using the method of maximum probability, if the values are independent, and, ! d ard 
usual weighting formula (w'R™) : see Factor Analysis of Qualitative Variables. In theory S < multi" 
probability calculus can always be used. Possibly a model could be devised which allowed ents. 
variate multinomial’ distribution to be employed, deduced from its first and second mo™ 
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IV. 
A NON-PARAMETRIC TEST OF SIGNIFICANCE 


All discriminati i 
equation or inating functions, no matter whether they are based о 1 
Observed D judgment, can be summarized in a two-way o ease 
Using ane е: Акы aie the mand. кошу to fall) against predicted category В. 
KOws В, to B, wi П, the columns 4, to A, will represent the observ jest 
in B, Would ui Ma be ше predicted classes such that if discrimination were eere e guns 
p perfect A oe cm om ue саваа ‘Ay, everyone placed іп Ba would fall in As, etc. 
th $ 1 square contingency table wo i filled, 
the off-diagonal entries ud e d y ta uld have the diagonal cells filled, 
total of the sd in the ith row and the jth column be denoted by mj. Let the marginal 
Principle adjustable, e B; and the marginal total of the jth column be 4j. Since Bi is in 
ме shall noi “к eR can always, theoretically, make B; equal to 4i . In this discussion 
ot necessarily Su simplification. We shall treat the general case when A; and B; are 
ations occur by chance 


Wec 
ап now de dee ЖЕ 1% >. 
define a pure chance discrimination where correct classific: 


alone, 
For a pure chance discrimination 
AiB & 
m= ay , where N = pa bi n. Q7) 


Our real i ee 
55 Ту 18) in the diagonals of the square contingency table. The misclassifications in 
one is is ае be occurring in a manner contrary toa hypothesis of chance distribution. 
rans classifications except for theoretical analysis of the mistakes. The point is whether the 
RENE therefore not b ae occurring more frequently than chance would allow. A chi-square test 
nificantly greater " exactly appropriate. A contingency table in which the diagonal entries were 
ater than chance expectation would necessarily have a significant chi-square. But 


a signi 
ificant chi. 
ant chi-square i: з агг у dai р 
square is not sufficient to show that the discriminating function has made a greater 
hance. However, the chi-square test 


er of b 
КШ always, der classifications than would be expected by cl 
etter than Borde and if it is not significant the discriminating function on the whole does not 
If w m 
Tt е are j $ i TOM 
&roups, then e interested in the total number pf correct classifications over all observed 
oe must test the difference between the (W0 quantities. 
= 2 nis E 
(Сй = number of persons observed to be correctly classified ; 
y chance. 


Ew 
pss dn 
i classifications expected b; 


ic, N = theoretical number of correct 
who are correctly 


Th 
Class: €n, th 
ified 4. € null h ; 
ypoth: i р 
pothesis will be that the number ene correctly ДЕТИ if the 


assi 

gnme OSS not di ) 

atr Ment differ signi 

a to ignificantly from the number who orf 

als; om rom (Categories were. based on chance alone : thatis, if Bı individuals were picke! 

9 be in А е total group of N individuals, the number of such Bı individuals who WI 
à ategory A, will be AB, 

umpa cases Whe NC E 

меге o 2 it is at the number of correctly classified individuals, 0, bie than oa hose cases 
T essa isni re: n 
€ o c4 Seater tp, Ssary to test for significance. We are ony inte pling from a population 

tail test 1 ortion of the curve 


fi а а e 
оо ak ап е. "This is essentially a опе- 
e on 


e а 
em Values from zero to N. But we ar e We 
^ е‹ 
small values of N the binomial (s + ме) should be evalua i 
N! { e M NES 
wind iN 


fsi 
ign; 
Bnificance С 
to the Would then b | | 
1 e to sum all terms ofthe binomial, from 5 29 Woy 
| the number of correct classifications 
denotes the 


ast 

Obta; ter; (s ү 

а m p bility tha 

i А ai 
hap ted y Nf + Thissum would give the probability thet that 

Ne, y cl 1 he as umption . tribution 
“XPectat Се would reach or exceed the number 0, s аве the binomial И 

> 


z lon ^ 
in the general population.* When N i: 


map a di 

Sap Ca] „ЭС с ith the mathe- 

тру, Pasig LO" of thi К ferences dealing wit ble 

Ў Wher а Burt, The och йе a ЕУ "m Hom work and ergometric formula from dou! 
war hild, рр. alio describe di 


© Conven; 
уе 
nient methods of calculation аг 
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approximately symmetric. If the probability distribution is symmetric, the usual ż ratio test for the 
(o—e) VN 
Wah 
ve(N — е) Е : 

The 99 per cent. confidence interval and the 95 per cent. confidence interval of the binomial 
distribution have been presented by C. J. Clopper and E. S. Pearson (7). Tables of these 
are given by Snedecor (28, pages 4—5). =: 

Even though the observed total number of correct classifications, 0, may be significantly seat 
than chance, it may be suspected that not all predictions are being made with the same accura 


difference between two percentages can be applied, viz., t = 


The ли for each B; can be tested against the e; (= A in the same manner as the overall test of 

oande. Неге the base number would be the number in the category Bi, rather than N. | ected 
The usual ¢ ratio test for the difference between two proportions can be made if it is sispa ES 

that category В; is a better discriminator than B; However, note that лш must be divided by Вга 

ny by В) to give the correct Proportions for the test. :minating function 
In this way, fairly precise tests may be made of the effectiveness of any discriminating trices, 

without having to make any assumptions as to normality of distribution, equal covariance ma 


be 
etc. Johnson (19) gives a practical procedure for determining when the normal mnis por 
expected to represent the binomial : (1) compute 9N/(N + 9) ; (2) if e» 9N/(N + 9) use the 
curve. 
ating 


Occasionally we may wish to compare the efficiency of two different discrimine 
functions on the same data. This leads us to a two by two table. The two rows been 
represent the individuals (1) who have been correctly classified, and (2) who have ilarly 
incorrectly classified by the first discriminating function. The two columns would si 
split the groups into correct and incorrect classifications made by the second discrimn: both 
function. The four celle would represent (a) those individuals correctly classified s by 
functions ; (5) those individuals correctly classified by the first function, but incorrec rectly 
the second ; (c) those individuals correctly classified by the first discriminant and cor 
by the second ; (d) those individuals incorrectly classified by both functions. ion of 

With this fourfold table in mind, we may now ask whether pı, the propor on of 
individuals correctly classified by the first discriminant, is greater than p», the propor cases, 
correct classifications by the second discriminant. For a large enough number © 
an appropriate test of significance would be 


(p —p) № а 
V Pis + рз —2rys( Pid ips)! int fourfold 
where N is the total number of individuals to be classified and Руз is the poin 4 may Be 
correlation coefficient. When N is large enough, c will be normally distributed an 
evaluated at the usual levels of significance tabled for the normal curve. 


= с; 


У. 50 RY 
MMA os from 


This paper has considered the problems of predicting qualitative attribut whic 
(1) quantitative variables and (2) qualitative variables. The canonical variate been 
define the minimum subspace within which a set of group means may lie 
derived from the basic equations used in analysis of variance. itative 
The maximum likelihood discriminating functions for the prediction of qu Explicit 
attributes from quantitative variables have been described and explained. 
formule have been given for the case of a multivariate normal distribution 
unequal covariance matrices and (2) equal covariance matrices. dictinE 
A maximum likelihood solution is proposed for the problem of Pre 
qualitative attributes from qualitative variables. ich 
Some non-parametric tests of significance have been suggested whi 
applicable to all discriminating functions. 
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APPENDIX 


On the Discrimination between Members of Two Groups (Note by C. Burt). Let ПО 

be two groups whose means (x, and x), standard deviations (c, and ca), and numbers (N; and m 
are known, and whose frequency-distributions are normal but overlap. It is required to find a poin' 
on the abscissa, x; say, which will best serve to discriminate between members of the two groupis 
To fix our ideas, we may suppose that the first and larger group (denoted by a subscript 1) Co) 
of all the pupils in the ordinary elementary Schools, and that the second (denoted by a subscript ч 
consists of all those who have been diagnosed as mentally defective by the school medical p 
and transferred to special schools: as tested by the re-standardized Binet scale in Daba » 
distribution of the two groups was approximately normal. Our object will then be to find a borde 
line, in terms of an I.Q., which will minimize the amount of misclassification. hl 

The number of children misclassified will be represented by the sum of two areas, MIL EU 
triangular, namely, the upper tail of the left-hand curve added to the lower tail of the right-ha 
curve. Thus the sum to be minimized will be 


+o z re ч 
АЙЕ М, E SS ЕЕ Tn, say: 

ls а S Vm ё dx — | Ф (х) dx + | Фа (х) dx = f + 

20 = ЁЛ —=© 


7 ата A А АРНЕ г, and 
To find the value of x which will minimize this sum, we must differentiate it with respect to x, & 
Set the first derivative equal to zero. The first derivative will be 


; - Bc. 1 imit, this 
By the usual rules for differentiating a definite integral with respect to a lower or an upper limit, 


expression = — о, (xi) + o. (x). On Setting this equal to zero, we obtain фу (xi) = 9s (х) : 
j с (x —%,)? А 
that is, e ent = сЕ 
01 ۷V2 02۷27 have 
But these expressions also represent the ordinates of the curves 1 and 2 at the point xr. pret 
thus shown that the required value for xi is the value that marks the point on the base-line a turd 


the two means, at which the ordinate has the same height for both curves ; and this ordinate in tive 
marks the point where the two curves intersect. Hence, as might have been guessed a 
grounds, the best way of discriminating between the two distributions is to cut betwee! 
overlapping curves at the point where the trough between them is lowest. rves, We | 
To express x; as an explicit numerical function of the constants describing the two EOS text | 
have merely to take logarithms and solve for x; We thus obtain the formula E an LQ. 0 
(p. 95 above, and Mental and Scholastic Tests, 1921, p. 165). It was on this basis 2 the ргоо! 
70 was suggested as the borderline for the mentally defective. It should be noted t 6 be used, 
assumes that we already know the proportion of N, to Ns; and the known values mS samples 
: y ication OF ©, 
selected by testing. Since the two triangular tails are not exactly equal, the application ^ ist's 
+ $ MIC 2 2 ocedur А 
instructions require him to fill exactly М» vacant places in the Special schools, then ano Рагер 
would have to be used, viz., Selecting the dullest М, on the LQ. scale (see loc. cit., р. classified on Р 


such а: 
Ые быс 15 


1 The following extract from (4) has been appended at Mr. Lubin's request. 
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THE INFLUENCE OF DIFFERENTIAL WEIGHTING 


By CYRIL BURT 


b Tie Problem of Weighting, I, The Effects of Different Types of Weighting. 
ү. Suited Sums as Factors, IV, Experimental Studies of Differential Weighting. 
mary and Conclusions. 


I. THE PROBLEM OF WEIGHTING 


СОД _in Psychology. In considering how to combine the marks obtained 
Comprising a Mation that includes several question-papers, ог from a test battery 
different; i a number of subtests, one of the most puzzling problems is that of 

ae Weighting. Is anything to be gained by assigning different weights to the 
of their i lests or papers, and if so what are the best methods to adopt ? In spite 
Involved PParent importance, surprisingly little attention has been given to the issues 
to be ar hat Freeman wrote in 1926 is even truer to-day : 2 Weighting has come 
practi ess commonly used than it was a number of years ago " ! ; and most 
Sychologists who comment on the matter are content to quo 
үт " weighting is not worth the trouble "—a statement, I fancy, which 


to apply not so much to the tests in a battery as to the items in à 


In the; 

later "their book on The Scienti ional P 
icati te 5 Study of Educational Fro 

a brie оп that eg es the E» problem—Monroe and E 
Атеш Y reviewed the relevant literature. They point out that the majority 
| <“ weighting does not affect 
A at the conclusion generally weighting 5 
1 rightly Say, 
the ty to any marked degree.” ? On the other hand, as they PRT s 


attent; СС Of weighti ee. 
ntion » eighting on the validity of the measures has PP ica] questions 


that 11 1 u 

а », P-, 190). before turning t ٤ р 

а tical mete be ed help those e are, од i e 
€ situația 1095 than in technical discussions if T begin ert 


Ons 

exam Practi x. Which such problems have come 
i cal I ч : 

Pupils tions portance. The earliest instances 2 

) y е 1 журе ына ers were accustome 

bes de а pe system tta terminal examinations 1n all the 


\ Year оп the basis of a composite mark 0 ы 
„ап 


i lass, H. К., 
5, 192 " ‘ons he quotes Doug 159» 23, 
DOT 26, p. 275. In support of his own conclusio m DEUS, Рой, XIV, 1973, 


9- , E 5 М 
of 12 Is it Necessary to Weight Exercises in Stand: l Ten тер Corel, $. ian 
Educ. Psych, XA > aod "Unweighted 


Pores Cightir later disces | 
от ting ater discussions, coming to muc! 

Ite, Of, Exercises j » 8 og t 
the Epid, ХХ Barrett, a T ет Marks Base! on Weighted fhich Determine 
А cti › 1932, рр. 92-8, q Nofisinger, F- R» 

Буу °° ‚рр. 92-8, Scates, D. E., an 

"n homei SE Weighting, i XIV, 1933, pp. 280-5. i 

| à ARN тето 1ай d chiefly 
n o › 1936, p. 448. ares, weighting is SA that in the 


Со 0 м 
the neg Dt arise: : ast squ ve ‹ 
NE one wi = x itr facts Ши їп сеп of least sq reliability | 2, 5) the observations 
rut ite xophysical measurement, it is again the differences іп theres 176, and refs-). 
Bist 5 Doi orm of weiehti Д T duced 
ls nte ` weighting to be introdu! © 
Compo d out in the memorandum cited below, the problem: 


Ss ble 
*pend far more on validity, ana fed Е by Edgewor 


The Influence of Differential Weighting 


subjects of the curriculum ; certain pupils were not infrequently recommended for a backward class 
or for possible certification as mentally defective on the ground of low marks ; other pupils were 
awarded scholarships on the results of a composite examination in a prescribed set of subjects. 
Discussions continually arose as to the relative weight to be attached to the different subjects : should 
not Arithmetic be given a higher weight than English, and should not both receive a far higher weight 
than Manual work? In such cases, it is to be noted, the interest centres on the single composite 
mark, intended to represent the ‘highest common denominator’ or ‘factor *; and this factor 
here relates to a more or less closed and restricted field —namely, the fixed group of subjects forming 
the elementary syllabus. 3 
Similar questions arose for discussion during the investigations of the International Institute 
Examinations Enquiry. In a memorandum which I was asked to prepare, an attempt was та e 
to derive certain general formule which would indicate the effects of equal and of differential weighting 
on reliability and validity ; a factorial procedure was suggested ; and arithmetical illustrations were 
given comparing the results obtained by different methods.! The main conclusion emerging wee 
that the chief danger lay, not so much in the neglect of carefully calculated differential weights, ИШ 
in the unrealized presence of random or unintentional weighting— particularly that due to асс! enta 
differences in the standard deviations of the examiners’ marks. In the published report, to 5 
Space and excessive technicality, the formule and their algebraic derivations were not reproduced 
in full ; _and both the inferences drawn and the practical recommendations made have since епс 
Some criticism. More recently, in reporting an investigation carried out for the Scottish СО 
for Research in Education, and after fully discussing * The Weighting of Measures,’ Dr. McCle Ty 
and his colleagues conclude by saying that, “in view of our results, we find it impossible Oole 
down a single principle, even of the most general character” (14, p. 83). And finally the W ices 
problem has cropped up afresh in connexion with the weighting of tests used in the fighting вае 
during the war and the weighting of examination marks and other assessments for the allocati 
of pupils to different types of secondary schools. 
| (b) Theoretical Importance. From a theoretical standpoint the question has f the 
interest. Without a clear appreciation of the effects of differential weighting, the nature O° 7^ 
various modes of factor analysis can hardly be understood. But in theoretical discussions ms 
factor analysis the factors envisaged are usually taken as defined, not by denotation (as somet ЛЕ 
соттоп to а definite and circumscribed curriculum of subjects or performances), but rai 
connotation (as a quality distinctive of a certain field or domain, of which the processes actus A 
tested or measured form merely specimens or representative samples). Nevertheless, t 
statistical standpoint, “a factor is still nothing but a weighted average of a set of measurements 
empirically obtained, being weighted according to the degree in which it demonstrably гер fusion 
the field in question "?; and, if this description were kept in mind, the common con * pro- 
between factors as statistical components and factors as concrete entities (such as ‘ abilities, often 
pensities,' and the like)—a confusion against which Professor Thomson has so rightly and 50 
protested—would less frequently occur. n the 
There is a further confusion which has not so frequently been noted, namely, Беркет апу 
different types of statistical component, regardless of any psychological interpretation. u field, 
factor which has positive saturations for all examples belonging to the same broad category, dure 
or genus is constantly described as a * general factor,’ regardless of the particular weighting M sch 
by which the so-called general factor has been extracted: in the field of cognitive proces factor 
instance, the * highest common factor’ in a bipolar analysis, the * basic factor’ in а gro. cegte 
analysis, the first “principal component’ in a Pearson analysis, and * Spearman's g ' are а 


‘ks 
E. C., The Mare, 
Rhodes was A 


a still wider 


1 Burt, C., * The Analysis of Examination Marks, ap. Hartog, P., and Rhodes, 
of Examiners, 1936, p. 303, cf. pp. 245f. As will be seen from the memoranda, Dr. d propos 
first highly doubtful, on quite general grounds, about the validity of factorial procedures, an Portiona 
instead to calculate the * ideal mark ' by “ taking a weighted average, the weights being ргор tions " 
to the degrees of precision, which are inversely proportional to the squares of the standard devia а 
(p. 193). Subsequently, however, he reached very much the same formule as myself (cf. P ob jed 
and in a later publication of great interest showed how factorial methods might usefully КӨП : 
in at least опе non-psychological field (Rhodes, E. C., * Тһе Construction of an Index О 
Activity,’ J. Roy. Stat. Soc., C, 1937, pp. 18f.). as he 
*Cf. Burt, C., Factors of the Mind, p. 74. Spearman, it is true, regularly objected D (cf 
interpreted the term, a factor was not, and could not be, either an average or а weighte' loc. cites 
Abilities of Man, pp. 61-5). I have, however, already attempted to meet his objections ced such 
рр. 211f.), and need not discuss them further here. I may add that, when I first introdu mind ® 
phrases as * highest common factor’ (3, 4) and suggested ways of calculating it, I ha s 
“factor ’ in an abstract mathematical sense. However, now that the term is used so fredi nore 
designate a concrete quality, it might be better (as I have suggested elsewhere) to adopt t} for the 
accurate term * component ’ for the factor before identification, and reserve the term * factor 

ability or other concrete quality with which that component is eventually identified. 
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as if they referred to precisely the same fundamental concept, even by those who would scrupulously 


rim from identifying that concept with * general intelligence." * 
no ор in this paper I shall treat the numerical constants that we are trying to determine as 
of them ане weighting coefficients in an equation of estimation, I do not mean to deny that some 
ments if they conceivably have a theoretical significance of their own. Just as the factor-measure- 
constants re саше appropriately selected and accurately calculated, would prove to be important 
Suitable tides lescribing a given individual, so, I believe, the factor-saturations obtained with a 
types of m [се of tests would often turn out to be important constants describing certain fundamental 
mark onl ental activity. In quantitative psychology, the enquiries v /hich lead to a factor analysis 
nly the initial stages of our work. Having by that means satisfactorily classified and defined 


the chi Bs D А 
hief modes of independent variation, we must next press on to determine, wherever we can, 
dations of a sound theory of 


th ; 

о парна equations which are required for laying the foun : r h { 

play a ew ONG IEE In such a theory what we now call factor-saturations will, I imagine, 

the lemen le, similar to that of specific heats and dielectric constants 1n physics or the valencies of 
LORS 5 in chemistry. This seems confirmed by several recent studies on mental work. But 

efer to it solely to remind the reader that I am far from regarding factor analysis and the 


calc 1 e r: 
ulation of weighting coefficients as the final goal of quantitative research. 
we shall be concerned merely 


Partial Regression Equations. In what follows, 
h are assumed from the outset 


wi Е 
ith the problem of combining measurements, whic assu 1 
what fallible) indications of some specified 


disposition; то ан 3 

for tactical property—some generalized characteristic W hich we үз to Ten: 

Such a ш Purposes, and which for convenience e be called a olor 

саду SE and what is its nature are questions We may suppose (0 ave 

act that wi ed. The school psychologist, for example, may have to accept t e 
ithin the schools concerned there is only one line of promotion, an 
2 hat the promotion of this pupil or of that will Ğepend on their marks 

m su ce а factor of * general educational ability ° * ; and his Ша 

$ Or tes be simply to decide what are the relative weights to E o 

bro, 4 the bes in English, Arithmetic, Reading, Spelling, Handwork, an Eee 
s f lest factorial studies the main object was ueni individual measurer e monly 


termy actor а; | i 
oon i iti ligence. 

by a d ў umed to underlie all forms of Cognition. senenge" col che V aod, um 

uin but by combining a wide 


3 single tes m d) 
n est (as Spearman and others originally assume, - ce arose 

their Whether (s Galton, Distt, E followers had maintained), us qua et “difficulty, 

Aer mie he dia tn abt be dilereni Se ае сае a oueit 

Presen d- eliability, or thei det end; M SO how the weigl 

Congi e i Um Yerkes, in his ° Point. Шы revision of the Binet series, contended ES он priori 

be мег lions (Ur maxima for the different test-problems, and put forward eae 

Daseq Ighteq T their determination? My own contention was that, in princip ا‎ coefficient 

on its; Ot equally, nor yet arbitrarily, but accor ing to an empirical reg" 
d ? mnn 


7 ascertai : 
i ained correlation with intelligence itse™- 
s used to point out between 


ena © Of ci "m х £x 

айо diferas доп is analogous to that which old-fashioned Iob пу? in the last of the four 

К and a‘ g -n * (a pro rium 15 а 5 { i et semper). 

пепео take the recognized о ce] "namely, id quod pertinet o mni et y is his 
wf oe scholastic instance if we define man as à * rational a factor, not the most 

+ actor’: ‘abili bec nesters а сотто! 

ang ji cou amenta], йн and really be justified ; 

а dat Ca 80 Often tempted to ask how far such 2 promoti e can at it enables Us to 

Cation mite x Que further advantage of the statistical treatment suggeste 

iey Pal abi Y to this question also. Thus, ир to the age 0 he individua ren 

У сев to account for nearly 50 рег of the the remaining a 

the “catio arious subjects of the curriculum ; E 

ay, aE general ү. abilities) БЕЛШЕ for only 10 per cent. or less 6 n "266; als 

Wertkes nP. 55, cae accounts for little over 25 Pet cent. : (cf. 6 P- і su 

of ш È: M » P.. 132, and refs.). ay 1915. FO 

"Ачар poked aly A Point Scale for Measuring Mental ees inet, Аза OB Te? oa, TOE 
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Sho tt, C a S 
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In support of this suggestion, I argued that “ if we adopt Professor Pearson's device of ° multiple 
correlation,’ it can easily be demonstrated that a far better assessment can be achieved with W 
representative sample of tests, covering a number of different cognitive processes, each appr орау 
weighted, than can be obtained with only a single test, or even a mere total of marks from cr 
tests taken as they stand." In seeking in this way to secure more trustworthy measures for t е 
* general factor ’ of intelligence, we are in effect taking samples or specimens of different types Fe 
intelligent performance, much as, in endeavouring to procure a trustworthy estimate of (ау) Н i 
proportional amount of water in human muscle, the physiologist examines specimens of all the ET 
types of muscular tissue. The only novelty is that we attempt to weight each specimen accor ing 
to its representative value. . 


. External and Internal Criteria. By what criterion, however, are we to determine 
the correlation of each test with the quality it is intended to measure? In the case 
of general intelligence, it was at first assumed that the independent estimates supplie 
by teachers or other observers would provide a valid standard. On that assumption 
the diagnostic value of each specimen test could be assessed by comparing or correla ia 
its results with this external criterion, and appropriate weights could then be corpus 
by the principle of least squares, in such a way that the resultant mark should Er 
the closest approximation to the estimates which the criterion supplied. Ошо 
ately, in many investigations no such external standard was available, and in о fos 
the independent estimates provided were not sufficiently trustworthy to be use lly 
that purpose. The method of factor analysis was accordingly developed, specifica у 
to meet cases of this kind. But essentially the same formula still seemed apple ee 
for calculating the weights, although the criterion was now based on internal rat 
than on external evidence. 


Substitutes for Regression Equations. The full procedure, howev 
plausible enough in theory, proved decidedly laborious in practice. ute 
lengthy calculations, first to determine the regressions or weights and then to conb 
the weighted sums. Accordingly, for practical purposes various simplified mer helt 
were commonly substituted, without any very full justification being offered for 
use. 


er, though 
It entaile 


А c, the 
(i) Unweighted Sum of Standardized Measurements. In the early days of factorial work sing 
most usual procedure was to reduce all the test-measurements to a comparable unit, either average 
Standard measure, or by converting the test-measurements into ranks, and then to sum rests, and 
the measurements so standardized. Tests which had a low correlation with all the оше. erimental 
therefore with the general factor, were generally rejected. Thus, in my 1909 article on Expo an 
Tests of General Intelligence’ (3), out of the whole series of twelve tests, I retained the bes efficient 
averaged the ranks for those six. This was found to raise the correlation from 0:76 (the © 
furnished by the best single test) to 0-85. ts were 
(ii) Saturations as Weights. In later work, where more accurate factor-measuremen not in- 
required (e.g., for the supplementary factors), the saturation coefficients themselves We ma 
frequently used as approximate weights. But sometimes a test with a positive saturation 
à negative regression (see Appendix, p. 124). two pro 
Gii) Simplified Weights. More often a rough and ready compromise between ШЕ. tos 
cedures was adopted. Instead of saturation coefficients, which entailed multiplicatio e bett 
or three-figure decimal fractions, simple unit-figure weights (e.g., 1, 2, 3) were adopte vill be fou f 


of problems in each test in such a way that the marks obtained for each test at once an. 
effects of a varying standard deviation, and gave the better tests a higher loading, without 
multiplication being required when computing the totals. ffects o 
Whatever method of weighting is adopted, it should be remembered that the real porte! (0 
Set of weights must be assessed іп terms of the relative contribution of each weighted COV" а]80 
the total variance ; and this will depend, not only (i) on the assigned or ostensible weights: f the © 
on the standard deviation of each component, and (iii) on its correlation with each © 
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Weighting i ^ 
i Non-Psychological Research. It is interesting to note that in 
Professor ee ah the relative * crop productivity * of different British counties 
have described a if ) has recently used two alternative devices, not unlike those 1 
а factor Sich is hey are, as he says, " based on psychological work, and measure 
Tegarding crop yi v reasonably be regarded as covariant with productivity b 
what I should. yields as analogous to test scores." His first method is to Еи 
the“ general Pin factor-measurements > for the * highest common factor,’ i.e., for 
Tegression E pA ае ^; and for this purpose he employs a partial 
is second tod derived in accordance with Pearson's method of * principal axes." 
àbove, namel hod is a simplified procedure similar to the first of those mentioned 
types of еру ranking the counties for their production in each of the main 
ultural produce, and then taking the average rank to indicate general 


Productivi 
: ity. s 
Y. He finds that " the closeness of the results, which are reached by 
cen the two sets of 


sidely diff 

e 4 Yos 

Те, Rood m thodi, is surprising ». the correlations betw 

S or four different years, all fall between 0:95 and 0:97 (inclusive). 
he same ranking 


ype of verage rank tends to give t 
p y She aN factor loading.” He remarks that he has “ been unable to 
SKS “ wh ical proof that a close correspondence will always exist ” ; and 

in psychology ° (13, рр. 25, 45). 


ether a simi 
er a similar problem has been studied i 


» SO far 
One t as these examples go, " the a 


п. 
RENT TYPES OF 


T 

HE EFFECTS OF DIFFE 

ing SPecial Prob WEIGHTING 

St ош о In examining the general effects of different modes of weight- 

Je 1n four stg involved can, I fancy, be most usefully studied by considering the 
qual weightine » namely, (a) a priori or subjective weighting, (b) random weighting, 

; r ad hoc weighting (1.©.› weighting based on 

rch). 


“Mpitical q 
of the particular resea 
1 


the O A Priori 
Om пог б a 
highest rods o wer eighting. In the early days of mental and scholas 
3 ighting proposed were usually based on subjective considerations [he 
bject which were deemed, on a priort 


Stounge eight 
nds. S We; E. 
the Most 0 be of e dena ees to those aspects of the su 
aminer, "liable, S est moment, or to those m 
is ordina? Or insta uch principles are still often follow t ‹ on 
; draws to the sup nee, frequently allot different maxim nt question-pap: 
See between pposed importance of the subjects ; and, азагы, 
С E needed weighting for value’ and ` weighting for validity. 
Ven. oot by sever: pos the criterion appears to be composite in. 
д Pensatory components, which may be independent, alternative, OF 
Y (as in occupational success). 
lop; ute, Scussi ШҮ: э tal 
'Obics тете is ieee of the need for a systematic determination of weights in attempts ai xe. 
«үү Ass, A Star ound in Edgeworth's papers on * The Statistics of Examinations’ айй Galton 
кше م‎ "(aeg cl» 1899, pp. пх PES: LIII, 1890, pp. 460- 6 m ; 
пзе t » 1889, pp. 47 t 
че weld i ра term р. 471f. < reliability on the ground that diminished 
салоні ® Proposed sc. diminished validity. By ' weighting for value" nean cr t ) 
ipneinatio alidity or аге based on judgments of practica or cultural wo county scholarship 
г than’ ог reliability. Thus, at a meeting of examiner? for ie jus pape 
imum allotted to. the | h-speaking COT 
lized English Fhe validity, of 


unit, han portal 
San it that a loe embers urged that the maxi 1 
38 to ADer e im ed to the Arithmetic paper, " because 13 a d 
Faig how «5 ап dnd ean for its members to write English than to do sur teresting suggestion 
por м Weighti ex of innate general ability was not discussed. For “Kelley. T. Lo Essential 
ental Life® for social importance " might be carried out, see т , 
5 €, 1935, pp. 70, 92-5 : especially equations (iii) and an: 
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In this respect the mode of weighting was not unlike that employed in calculating Compos 
“index numbers ’ 1 by early economists ; and thus open to very similar criticisms (cf. 7, p. 212, fora 
remark which applies admirably to the psychologist’s computations). A priori weighting is Occasions 
ally used in other scientific fields. But, according to those most competent to give a judgment 2s 
the matter, “ ‘empirical’ weighting, that is, weighting according to individual judgment (or, Е 
the psychologist would prefer to term it, ‘subjective ° or * impressionistic” weighting), “is alway a 
matter of difficulty, and at best its results must remain open to objection." * Accordingly, I ventu 
to suggest that even here a factorial procedure might prove suggestive. 


Even where those concerned are unanimously agreed that trait x (say) has a higher social e 
cultural value than trait Y, and therefore deserves a weight which is л times as large as that аз ога 
to y ( > 1), there seems to be no generally recognized principle for deciding what numerical Yi 
shall be assigned to n. Accordingly, I suggest what may be called the principle of virtual frequen a 
To allot to x a weight which is л times that allotted to y, means that we are in effect treating сас the 
the x-measurements as if they had been based on » times as many independent observations as 
y-measurements. 


I may add, however, that, alike in psychological testing and in school or university examinator 
‚ Wherever I have been able to check the results of such subjective weighting against objective foy Sir 
its value has proved extremely doubtful. I should thus readily endorse the view expressed ers 
Percy Nunn. Ina memorandum on * Methods of Marking ’ (dealing primarily with the TE е 
Certificate Examination), he has stated that, “ with rare exceptions, no satisfactory case can oe unds. 
for allotting different weights to different examination papers merely on personal or a priori ec ts on 
The wide differences between examiners as to what the weights should be, and the erratic ef senting. 
mark-lists, suffice to demonstrate that in most cases subjective weighting is just random УЕ 
What the precise effect of such random weighting may be it is scarcely possible to predic ce, the 
one thing can be said with certainty : its general tendency must be to reduce, seldom to enhance, кү 
accuracy of the final marks." 3 . 


at, when 


(b) Random Weighing. As I have already noted, it is too often forgotten th unt 


a set of measurements or marks are summed or averaged just as they stand, a certain ашо 
of weighting is unintentionally, and often quite unconsciously, introduced, um rences 
fact that the raw marks or measurements have different standard'deviations. Such diffe or to 
in the scatter are likely to have no relation either to the value of the separate uh the 
any common factor underlying the whole set of tests or to any valid criterion Y random, 
tests or marks are intended to estimate. Thus their effects are equivalent to that o СО oftel 
weighting. Since, then, both deliberate weighting and unintentional weighting ined 
tend to introduce a large element of random weighting, it seems desirable to detere "i 
we possibly can, what (in Nunn's words) “ the precise effect of such random weigh 


likely to be. t hap- 

Let us, therefore, begin by examining the effect of weights, selected absolutely or ents, 
hazard, on the weighted sums so obtained. Let us suppose that we have л sets of aS руг 
Xr Xo +++ Хи, each set measuring the same attribute, x, with different degrees О ibute. “ae 
or, it may be, measuring some different but typical manifestation of the same EET s as 
the reader wishes to keep a concrete instance in mind, he may think of the mE ai 
the result of applying л tests of intelligence to N school children.) Our ultim ropriate 
to combine these 7 measures into a single composite measure by means of some apP and for 


linear function. Let the set of weights for one such linear function be designated Vi, 4 


to this and other problems in economics (10, p. 313). Bowley’s definition of index numbers D is по 
ira f ior The chief difference (which curiously mmonly ty 
brought out explicitly by his definition) arises from the fact that index numbers are co! rbitra! 


"in : " na 
to measure variations which are related to time, and hence are expressed as percentages O like factor 


vely (ty the, 


М 
ren 
sa] Meas, 
* Freund, L, The Study of Chemical Composition, 1904, p. 85. Cf. Kohlrausch, Physical Measure 
ments, 1894, especially chapter on ‘ Errors of Observation,’ and Ostwald, Рлухісо-Сћеті 
ments, 1902, chap. I, * Calculation.’ 


‘ona 
зш з the Internat! 
This opinion was repeated in a later memorandum submitted by Sir Percy Nunn to 
Institute Examinations enquiry. 
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another И, (i = 1 
= 1,...п). Th B 
to compare may be sek en the two alternative composite measures which we propose 


y = Uy, + Usxs +... + Unxn, (i) 
И кене z= Шух, + Were +... + Warn. (ii) 
e that x;, y, and z are each to be in unitary standard measure 


Now let r deno VI = s ations, an е means. 
Es Ü note the average of the n(n 1) 1 i 
а 2 1 ^ inter-correlati U and W th: 
w? the variances of the weights. It is then easy ELS (see Appendix) tha | 
ind oy? and с е that, 


if the wei 

ights have bei 
m ^ en chos Я 
easures will be aue. at random, the correlation between the two composite 


-— tz r) си? w? 
hz = 1 "E 1c: + x) i (iii) 
21-05008, (i) 


nr 


an ? 53 
мезе two oe deviations of the weights are the same. 
i sions indicate very clearly what will be the general effect of random 


Dg in y; 
arious a 
c 5 z 

Mhe larger the е l cases. The following conclusions тау drawn. 
ch ing п between the OE tests employed (i.e., the higher the value of n), the higher will be the 

Osen absol lation, Le tr eighted sums. As the number of tests is increased indefinitely, so the 
) Q) T utely at random s to become perfect, cven though the two sets of weights have been 
P) the p; е la s 3 

2 the hi arger the a А E 
rr verage inter-correlation between the several tests (i.e. the higher the value of 
1 сел » SO the ` е s the average inter-correlation 
OWer will Chosen absolnte Iting correlation, ry:, tends to become perfect, even though the weights 
Ctor fae е value are y at random. Conversely, the lower the average jnter-correlations, the 

Correlati rz. Ап important corollary follows. If, in order to be sure of assessing 
е апо ighly heterogeneous set of tests, which have low 
n i number of tests in our 


Ty, the; th 


When the meats 


Th 
Кеп the the sta 
1e twi the standard deviati 5 
oth E Weight, o WCighte eviations of the weights, : 
eet) ЫШ are one ‘Sums. If in both sets the Standard deviation is Zero (i.e., 
the tions, th = correlation A though the weights in one Sẹ i 
preti Xividu, € lower will b will be unity. Conversely, with rand а 
ly clog Weights d be the correlation. This confirms the familiar observation t at, provide 
o not vary widely, the agreement 


Teq Se 
uceq, ^ Whereas whe between the t 
n the range of the weights Va greemen 


i ries widely, the à 

of the Sain t both sets t 

е he agr he mean weight is large in com arison with the standard deviation, then 

erences bene ent between the two ETE pus will be close. Jt is thus the relative size 
een the weights, not their ‘absolute size, that affects the resulting болш 

s to be ositive, both the mean weights 

i weights or all the 


Positi for t : 
hag dive wes Same = correlation between the weighted sum p me 
“һе ресі ts are РЕВИИ if in each set the vast mann 0 x e peen 
i e : on ; 
е throughout, then the resulting MET d i 1 oe al proce dires 
jab prod 


s for the 
inci ent,’ it is 
first * principal componi m the battery 
Ci 


Psyc CF it is arn 
(with osical a à Senes" Calculation of measurement 
[3 "nquiri: |, actor, a Я ic’ fa ога Ў 
teal um POSSÍble excep likely to а зо ie saturations for all the tests OT trails Jh ence, unless 
ting mec! tests ception of one ог two variables which may act as ' suppressors ^. ЕТ Jair the 
(c urements mits is small, gross inaccuracies in t weighting coefficients will Imp 
Tr tiver qual Wer] far less than might be supposed- 
Ato е hti "eos 
iehting. Let us now turn to the Case in which the Stanica grounds. 


or, a qual w 
ргас Wei С tur | 
те Ehts. This is of special interest on both practical s for combining à set 


Cor Asure ical : 
"bin, ents [ns ndpoint it provides by far the simplest proce u^, 
th Let 9 implicit, one composite measurement ; fro a theoretic ). 
" Same 5 COnsige y adopted in the analysis of variance (see Ma dministered twice to 
rfi a 
Oup rst the common situation in which the 7 tests Mod above were applied first 
jx tes 


53 Or example, іп the 1909 enquiry the 5P* te 
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by myself and secondly by Dr. Flugel. A straightforward deduction from Pearson's product- 
moment formula yields the following result :* 

t "s ZG + xs +... + х4) Ga’ + xa +... +30) T 
E VS Capi ah ut VISUS oe cer ae Oe 

drat + XXra ra + (п — ra v 
n+ ELY ` 1+ (A Drs | 

where the dash denotes the second application of the test, and ri the reliability coefficients (it B 
assumed that ry and rry have been pooled to determine what is here called rj; i # j). Inthe examp! 
cited the formula yields rzz = TT 0423 = 0:898. On calculating the correlation direct 
from the averaged ranks, the value reached was 0:902. (The slight difference is due to counting) 

If we know the correlation of each test with the criterion (c, say), then the correlation of the 
unweighted sum with the criterion will be 


те = JESN (LJ ET s.n 


(1 = 3). Wi 


By solving for л іп one or other of these equations it is possible to estimate roughly how many ШО 
will be required to yield a correlation of a required amount.? If thecriterion isan internal шеп 
based on simple summation, the expression thus reached simplifies to the square of the expres RC 
reached in equation (v), with raw now representing communalities rather than reliabilities. d by 
reader can easily deduce for himself the corresponding formulz for an internal criterion derive 
weighted summation or any other procedure. ob 
^ Eu same argument shows that the correlation of each test (say, the kth) with the average 
all the tests is 


77 d ب‎ vii 
te = VY (hj = 1,...п), ii) 
where гу, the self-correlation, is here taken to be 1-00. Now, with the majority of co 
obtained in psychological testing, an application of this formula would at once reveal t 
employed have a widely different correlation with the * general factor,’ estimated provisiona 
unweighted average of all the tests. And this naturally suggests that it is a mistake to gl 
tests precisely the same weight. Hence we are led to enquire how the best set of differentia 
can be procured. ost 
(d) The Best Weights. If we ignore the possible existence of specific factors, the med 
obvious method would be to seek the * line of closest fit. This was the procedure prop 
by Karl Pearson, who defined ‘ closest’ in terms of least squares. He further pointe cipal 
that the requisite weights could be conveniently obtained by seeking the “ largest EA 
axis of the correlation type ellipsoid.” And it is, as Kendall has observed, “ a теп ace 
fact that all the information we need, even to represent the variation in a linear Su 
of m dimensions, is contained in the correlation matrix." * 


rrelation tables 
1 that the tests 
nally as the 
ve all the 
1 weights 


arl 
1 The principles from which this and kindred formule were derived were first developed by Кап 
Pearson. To psychologists equation (v) is best known from its application by Brown an ,, and 
to the problem of increasing the reliability of tests by increasing their length (cf. Brown, out that 
Thomson, G., Essentials of Mental Measurement, 1925, p. 132 and refs.). Brown points source 
his formule were “ based on demonstrations given in lectures by Professor Pearson (the 5 
from which many of my own were also derived). 


* See 10, p. 303, where several numerical illustrations are given. It should be emphasize тера! 
the assumptions involved are never likely to be precisely fulfilled, the indications must be 


only as approximate. 
? Pearson, K., * On Lines and Planes of Closest Fit,’ Phil. Mag., II, 7th ser., 1901, рр. 559f. cludes 


t, 25 
d thats d 


*Cf.17, 89. I should myself prefer to say “in the table of covariances and variances." This! an be 


n бюд А y p c 
the correlation matrix with unity in the diagonal as a special case. The method, however» 


sums) + 
applied to the unstandardized table of variances and covariances (or square-sums and pro uct: sehen 
and Kelley and others have supposed that the components so obtained would be the same the case 
obtained from the standardized table of correlations : but it is easy to show that this is not 
For certain purposes an analysis of the unstandardized covariance matrix seems preferable. 
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It is instructive to note that the same solution is reached with formulations of the problem that 
at first sight seem quite different. For example, we may seek (i) weights that minimize the squared 


discrepancies between the observed rest-measurements for the individuals and the fit based on the 


hypothetical factor-measurements, (ii) weights that minimize the squared discrepancies between the 
Observed correlations and the fit based on the hypothetical correlations, (iii) weights that maximize 
the variations between the individuals, i.e., the general factor variance, or (iv) weights that maximize 
the square-sum of the multiple correlations between the observed factor-measurements and the 
hypothetical factor-measurements : all lead to Pearson's method of * principal components." 

Kendall has stated that “ this approach is the first which the statistician would consider, and he 
[the Statistician] may be a little surprised to find that the psychologists do not give it pride of place. 

ut in the early days of factorial work—say, from 1909 to 1924—all those who attempted to factorize 
complete correlation tables would, I think, have given Pearson’s method, if not pride of place, at 
least priority of place. The reasons for the ‘modifications I have explained elsewhere (16). 

The chief objection to the method was, of course, the difficulty of carrying out the calculations å 
and this, as we have seen, led to the substitution of simpler schemes of weighting and in particular 
to the use of equal weighting. But, if we may judge from recent discussions of factor analysis by 
statisticians, the change which has aroused the most widespread criticism is the use of reduced self- 


Correlati , ion ! hav ‘ven elsewhere, and may be briefly 
elations. My reasons for making the reduction spe EA to explain the observed 


Summarized as f | | ; ^ unl 
correlated as follows, With self-correlations of Wy Wè ent the specific Factors by 
bviously à by more factors than the data warrant, and (i) (0 represe As the 
fee t КОД! bipolars. (iii) Further, Pearson's procedure, which seeks in psycho- 
Ogical wor nethod of least squares, places all the test-results on the sa Sier widely in their 
Precision the Measurements supplied by tests of different kinds are apt to differ P 
rst investis. their validity. For example, among the twelve intelligence tes 0-8 
the роот. ОП (3), the best had correlations with the criterion amounting to 0 
Calculated ad correlations of only 0:20 or less. But it is surely uo UY id 
ely as thalmates of intelligence should fit the measurements furnishg И S 
e еу fit the measurements furnished by the best. AX? allowance des 
Self-cgrrepy What I termed * correction for specificity’; and this in turn ental 


Tela 
ations for self-correlations of unity : (see 16, Р. 111 and refs.). 
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15 onwards this substitution became almost Moce ure more recent Ph 
ma SEI шош complete tables of correlations en Pt his brief reference nalities ” 
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first factor ; and for later factors the loss of accuracy tends to be cumulative. For 
rough pilot enquiries, no doubt, the method may often be suggestive. МЕШ 

In such cases, however, the question is naturally raised : are we really justi E 
in regarding these rough and ready estimates as representing ‘factors’? Ѕреагт e 
and those who worked with him insisted that they could not be treated as rd 1 
at all ; and this appears to be the general view. But I believe it will not be diffic 
to prove that such estimations have all the fundamental properties of a factor. 


, 
‘ 

It would, I fancy, now be almost universally admitted that what is commonly called a pos , 
is essentially a ‘weighted sum’; yet it is not so widely recognized that any ‘ weighte terms 
constitutes a ‘ factor To demonstrate this point, however, it will be necessary to define о орва 
a little more precisely. As Professor Kendall has pointed out, factor analysis has been deve oha 
mainly by men who were not primarily statisticians ” ; 3 and in consequence it has “ tended мр: thé 
a language of its own." But, although there may be some doubt over minor points, mos to the 
terms employed by psychological factorists can be translated without loss of meaning 1n 
terminology of matrix algebra. s as an 

If, then, we agree to regard a table of measurements for n traits obtained from N pemon ctions 
n X N matrix, M, say (where n < N), we can regard the figures in each row of М, as linear oP Bees * 
of r linearly independent 1 x № row-matrices or * vectors,’ chosen to serve as a convenient r, these 
for all such measurements, where r will in general stand for the rank of M, (r € n). Morcover, 
selected vectors may or may not be statistically independent. Thus we may write 


Mo = FP, (viii) Lm 
where Fand P denote what (in the usual psychological context) would be called the * factor-saturatio 


1 : ates 
for the traits and the ‘ factor-measurements’ for the persons respectively. The equation 5 i 


that the measurement for the jth person in the ith trait can be expressed as a weighted sum, VIZ- 


"nig = fa pu + Лары +... + уру = Уры, ш 


where fix are the * weights * assigned to the kth factor when reconstructing the ith test. By «оа 

the equations for the p's (ог adopting some equivalent procedure) we may also succeed in exp 

the p's as linear functions or weighted sums of the m's. ote the 
Now if we use fy (with a single subscript) to designate the kth column in F, and px to den о 

Ath row in P, then we can write fip; = Mz, say, where МЕ will be an л x N rectangular v factorial 

“rank one,’ that is, all its rows will be Proportional ; in other words, M; will form what an sed into 

Jargon is called an asymmetrical * hierarchy.’ It follows from (viii) that М, can be analy: 

r additive sub-tables of this kind, so that 


Mo = Mit M, + o. EM; УМ. (9 


If equation (x) is to be fulfilled exactly, the number (r) of unit-rank sub-tables in this SC 
be identical with the * rank ' of M. By the rank of a matrix is meant the number of linearly 
pendent row-vectors (or column-vectors *) contained in that matrix; and, if an exac 


ЭЛЕ т 
ries wil 
inde- 
torial 


s< with those 
1 I endeavoured to illustrate this conclusion by comparing the results of factor analysis with cally 
of analysis of variance, since with analysis of variance weights are in effect treated as nU e same 
equal (cf. 15, section on * analysis by simple averaging,’ and Table XII, last two lines). sing factor 
point emerges if we turn to the illustrative example provided by Professor Kendall. On изи 
analysis to extract supplementary factors, we find that these clearly classify his crops s 
roups—cereal, leguminous, roots, and Brass. But if we use unweighted figures and try | greatly 
Гое сиза оп by working with simple ranks and rank-differences, the grouping 1 
ured. 
* Cf. 17,81. Iwould also add ** and by writers compelled to use language that would нер. ith 
intelligible to psychologists, teachers, and research-workers, who were not themselves fam! f what 
Statistical terminology.” Thus a non-mathematical psychologist seems to get some ink ing wildered 
i ommon factor,’ whereas he would i И 


he understands how to calculate а * weighted sum,’ but, w i оша in Ule ear 
> , меге he an editor, he would i c lin 
days have refused (as Ward once did) to publish a paper which НООР "S Stich technicalities 2 
5 
ә ion it follow: 
? The number will be the same whether we Select rows or columns. From this definition itf This 
that the rank will give the order of the largest minor determinant in M, which is not 07, P- 2 


F ў 515, 
Aitken, A. C., Determinants and Matrices, 1939, p. 60). But for purposes of factor analy rice 


definition given in the text is more convenient : itist Vectors апа 
1943, pp. 22, 64. hat adopted by Macduffee, 
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reconstruction is desired, this will be identical with the total number of ‘ factors ’ precisely obtainable 


from that matrix. 
я Linearly Independent Factors. Тһе most general way of effecting such an analysis is 
o take for the first hierarchical matrix an expression of the type 


1 5 
М, = HT (xi) 
wher = $ 
е зр = wy! My, s = Mow, , and v = wg Мын. Here sp will be a row-vector ud 
ch column of Mo; St W 


elements l 

are derived b ighti i in ea 

a co y weighting and adding the figures in € c 

lumn-vector whose elements are derived by weighting and adding the figures in each Tow 


of M G 

ee context i d s, a column of sums 
xt s» will be a row of sums for persons, anc тет. 

AS 1 djustable divisor introduced 


Or tests) ; i : f 
to ban) 5 у, a scalar (analogous to a variance), will be simply an à ble 2 
may bee the products s,s, to terms of the same scale as My. So far, it 1S to be nee ш 
divisor uy row of weights, and w, may be any column of weights, provided, of course, 
15 not thereby reduced to zero. Mw 

rik an application of the usual criterion for rank will quickly show * that E 1 

€ initial matrix M,, the matrix of first residuals, that is, Mo — Ma = ony WE опей A 
Precisely t ules is one degree lower. Moreover, these residuals in шен ҮШ pie sehen 

ame way. Once again, any set of weights may be Pene: СОКЕ Бү кА 


аге not th 
tractio € Same as the first set ; and once again the rank will be reduc \ will 
be 100 and factorization r times (r being the rank of Mo), Whereupon the last set of residuals 


und to vanish ; 
vanish identi i 
entically. divisor may be considered 


At th А ^ 
as applica Same time we may not every stage, the standardizing d1» 
er А рт to be pue Se " x Tete AE session Mow: (and their subsequent апи choice, 
» may be een the two (by taking the square root o Jhus, асса if desired) 
US the woeatded as specifying factor-measurements for the person? (standai eig i ue 
dy s. Conse- 
quently, ps nts for the traits (standardized if desired), and Sp с for the adc ЕРШЕ 
RES resolution I have described may be effected witha yy ULT ea 2а aa oe ane aim at the 
equate th to analysing and classifying the traits. t pn Meri fae instance, welds that are 
азе of yore Another iehting based on à single test (say 1,9 ^^ 
of (say-1, Leal), ora weighting ed OP ect an ana 


Statist; of fact 
if We се cally Independent Factors. A more narrowly defined x Re 


Statistica get Our weights ments wi 
N s ctor-measure 
taking w, У > independent de. а), This сап be effected БУ fet of freely chosen 
“Ights, W f 2 Mo’ wp , and thus avoiding the introduction O a seri) now becomes 


e then have y = Wp’ Mo Sp = Wp Mo Mo Wp- Equa E 
Ro Wp - We Mo R (xii) 


wp. wp M. 
Wher, um ; Seg m Mo AM o Wp up c d to be in 
Mem D ed is assumed to 
ч d : s usual) is а 
po Stan Notes the matrix of observed correlations and M, (8 


ard meas 
Ost 5 ure. have 
Andardize з, we must divide it by the rootSquare sun Wehen 

wy Mo- — wy’ Mo (58У) 5 

Е —— 1 
and we ру = WE Sp = Уну Ro We a) 
ron, hen take My My н» 2 Mo pi’? 
0 hat 

s 


(xiii) 


А fr = vist = Уру Кон» ctor-measure- 
(n) "lum of correlations ewe попатна С за Tig oat 
qoe irre subse Му A AP тада advantage sy rir лаш 

M, matrix оао the factor saturation. fa пе farger initial 7 X N matri; Үс чык 
tave Сал factorize t| Р K А 2м, = М in precisely ne me in be any set of 
1 ° second dp boe Ps 23 Mae VORA up Ads 

i Lagrange (cf: вос 


Thi 

log 18 Stat to 

* cit, eme, € jmately due 

"up. pp. {iid the general method of redor eres ш atrices, 193% PP 
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i ion 
weights. And for the second set of factor-saturations we have fa = Ко Wa. The correlati 
between the two factor-measurements will now be 


Papi Wa’ (Mo — fip) pi! we’ (Mo pi! — f) wa’ (fı — fi) 0. Gy) isof 

This result shows that, with any two sets of weights, w, and Wa, We shall obtain моа Ка 
uncorrelated factor-measurements. (The only restriction is that the divisors Wı Row, an MED 
shall not be zero ; ana this evidently prevents us from taking the same set of weights for ne a Sing the 

We can continue the process to analyse the successive sets of residuals, until, after de "3 ЫЙАН) 
rth hierarchical sub-table, we shall be left with residuals of zero. It follows that we may (i Mes As 
choose completely random weights for Wi, Wo, ... Wr, Or (if we prefer) unit-weigh ts a; 
11, +1, ... +1] (in which case we have to alter the sign-pattern to prevent w, = ws = determi 
or again [1, 0,0, . . . 0J, [0, 1,0, . . . 0), 0,0, 1, ... 0]. and so оп. We can in this way 
the rank of M, , and resolve M, into an additive series of r hierarchical sub-tables. f M, will be 

In practice, of course, with an empirical set of n x N test-measurements, the rank o. АУ © 
п or N, whichever is smaller; and our interest will be, not to ascertain the complete nu nificant 
factors, but the minimum number of significant factors, that is, what may be called the e entary. 
tank’ of M,. Into the further limitations imposed on w;, We, etc., by this or other supple 


8 و‎ z o ired are 
conditions it is unnecessary to enter here, since the more specialized procedures then require 
already sufficiently well known. 


IV. EXPERIMENTAL STUDIES OF DIFFERENTIAL 
WEIGHTING 


An Attempt at Empirical Verification. To illustrate the effects of the di 
types of weighting we have discussed, it may be helpful to glance at a few uu 
cases. The object of the following experiments was to examine instances where. (the 
happened to be an objective check on the calculated * factor-measurements s the 
Weighted sums). In this way it is not difficult to show that, at any rate so ar e 
dominant factor is concerned, (i) equal weighting will commonly provide mo e ds 
good estimates, (ii) much better estimates can be obtained by a factor ni 
and that (iii) weighted summation with reduced self-correlations will yield m 0 
estimates than any other procedure—better even than the classical met onal) 
calculating the line of closest fit (i.e., weighted summation with unity in the diag set 

An Example with an External Criterion. Five children, aged 12 to 15, were E from 
of 80 boxes of the same size and Shape, but different weight. The weights Йа, ht of 
150 to 250 grains (about 10 to 16 grams). Each child was asked to assess the derive 
the boxes ; and the investigator’s problem was to ascertain how closely the estimates to 
by averaging or otherwise combining the children’s judgments would approximate values. 
true values, when the mode of combination was determined without reference to those 


TABLE 1. INTER-CORRELATIONS BETWEEN JUDGMENTS OF WEIGHTS 
Persons BER. F.B. A.M. J. C. H. W.R. 
ERR ae (931) 636 531 407 238 
йс; oe -636 (948) 575 354 P 
A.M. > -531 575 ' (872) 178 13 
CN: о. 407 354 178 (706) 053 
W.R. Ж: 238 152 131 053 (553) 


1 The last two types of unit weight 
implicitly adopted in analysis of variance : 
the old triangular reduction discussed by 
preliminary reduction to simplify subseque 


eightin€ 
o 
form ©, 


dependent on later : (see Burt, 10, p. 307 and refs.). 
indirectly to Karl Pearson (cf. Biometrika, XV, p. 135, 
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As with Galton’s test-weights, the successive increments were geometrical}; and the set was 
so Selected that, judged by their logarithmic values, the frequency-distribution of the weights would 
SbproxImate to a normal curve. They were presented in a randomized order. Before lifting the 
Rist, each subject was given three standard boxes which he was told weighed 100, 200, and 300 grains. 

о allow for Weber’s law both true values and assessments were converted to terms of a logarithmic 
scale, and were then reduced to standard measure.* 

Owing partly to the comparatively small differences in the weights and partly to differences in 
care or discriminative capacity among the individual children, the agreement between their judgments 
is by no means close. This is shown by the inter-correlations,? which are set out in Table I, and give 
what may be called the * inter-personal reliability ’ of the assessments. With 80 items the standard 
error of a zero correlation is approximately + 0:112, and any coefficient over 0:217 may be accepted 
th Significant. To obtain a measure of personal error or * intra-personal reliability * for each child, 

n experiment was repeated with each subject after an interval of a week; and a correlation 
res culated between his judgments as given at two successive trials—a procedure originally suggested 
кА Wissler.! To check the weighting equations derived from the first experiment the same 

ildren repeated the test with a second set of boxes ranging from 200 to 300 grains. 

hi Agreement of Individual Assessments with the Criterion. (a) External Criterion. Each 
t ild’s assessments were first correlated with the * true weights.’ The coefficients are shown 
in the first column of Table IV. It will be seen that the correlations range from 0-196 (which 
is scarcely significant) to 0-841. Two out of the five children not only have reliabilities well 
пм 0-90, but also appear able to arrange the boxes very satisfactorily in order of weight. 
alo m quite a good estimate would be obtained by accepting the judgments given by L. T. R. 

It would, h icip: te estimation, if we relied, not 

и ‚ howe 2 anticipate а more accurate estimation, > 
n e fl Понг pem Каи lower god, Ви on ie conte s ae 
Indce ‚1 ee it), which can be obtained by eat зар е available for averaging, 
the Bteater is шешу supposed that the larger the number 0 f all therefore, let us try averaging 
г Bments Specs! of the average so obtained. First of all, Кү average for cach of the 
BOXES, We obtas tained from all the judges. On calculating а | rhe same coefficient is reached 
an Use equatio à correlation with the true values of оп ined with the best of the five judges by 
m insigni СА nde (vi) above.) This figure exceeds that o tat vhich would be obtained by averaging 
assess ant fraction only ; and is not nearly as high as that v veraging all the assessments 
dise Ments of th ; я It is clear that, by @ 

criminatel e best two judges (0:902). he poorer subjects to count 
not kq XDeCte due have allowed the inaccuracies di де However, in investigati 

NOW the trey pooling the judgments of the best Шу were the best judges. 
Our ор, the E Ye lues, Weiconld поа ШЕЕ TA v {сте thei nal indication. 

т only % 26n, in the absence ternal check, can We бетек. ovisional indication, 
and th Y Means of doing so is gayem joint report of all the judges as û pr 


measure th i 
е a his agreement W H rage О! 
+ accuracy of each one by | ag pstituting а weighted ave : Stable set of 


1 
(sj 
i f. Galton INE 
hee. soo Inquiries into Human Faculty, 
Sc. Xperi; 
ale (cf. B 


re 248-51. a 
1883, Appendix, PP- TE e original Binet-Simon 


ime i «nts? introduced у n Persons, 
nt was suggested by the test of ‘five weights g10 and * Correlations "Gents, both at 


Brit urt а 
ТИ. J, >» C., Ment ic Tests, 1921, P- ,8 m e 
Шер о ^; ХХҮШ a E i na a practical labo O ally with ое. o ү 
Ta d Lon | Быш :t as a stock ex! 4 he di 
тосе е Various оик ану ne it as articular, the analogies and үе to rely only o 
Бетта «tuted (a) Син Г, п external criterion and ( ) mue. We lysis to correla 
Ч crion. Inci it exemplifies the applica 
à TS ncidentally it exempt for 
m D eher ^ as well as between tests or traits- F. Chalmers whom 1 nave, tO, ho whom 
i i 1 " а 2 m » 
(a Ed to mee ons bim poi oth by ork ha ‘bee! checke b i Rein but the correla: 
оп tabt. Tuch ind gures ; and most of the v eo "hildren took the york 
УҢ le indebted. I should add that more (19 sition а a 
our ome rte ere gar o cgi Lad p aeui Mon Supp + 106 the tests, and 
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i e 
(b) Internal Criterion. Let us now imagine that we present a list of figures аа 
five sets of standardized assessments to an investigator who knows nothing w за ds apis 
their actual nature or source (he might, for example, suppose they were marks for e ERIS 
in five subjects of the elementary curriculum) ; and let us ask him to determine v e 
for as many independent factors as will reasonably account for the variation in the MEE five 
It is evident that he could not in any case determine measurements for more Р ББП 
common factors. If he follows Pearson's procedure, he will take the variances p. E. 
correlations to be unity, calculate his factor-saturations by weighted оптат оа Er the 
obtain weighting equations for the maximum number of components, namely, cn be 
matrix of regressions or weights for determining the measurements required wi Mec 
simply the inverse of the factor-matrix. The factor-saturations thus obtained are s ee 
Table II A. It will.be seen that the first component accounts for about 49 per cent. 


cent. 
total variance, and that the other components account for 19, 17, 8, and 7 per 
respectively. 


TABLE IL. SATURATION COEFFICIENTS 


Persons A. For Observed Table B. For Smoothed Table 
V 
I п TI IV у 1 II ш Di 398 
LTR. | 861 -005 -018 —-307 —-410| -865 —-062 —-138 Е Ly 
Е.В...  .. | -849 —.096 —-156 —255  428| 835  — 070 —-156 es 1012 
A.M...  .. | -754 —090 —-481 ‘440 —066| 747  —.097 —-304 106 011 
J. C. H. “+ | 336 —354 717 221  -036| уу —201 -803 09 .007 

WARE 321. 907 .252 073 -062| -316 :940 118  " 

o 321 

Medeae dev 19x40! , 537.84 406 3601 405 942 794 "538 


| 
TABLE Ш. CHI-SQUARED ANALYSIS 1 


IEEE. 
QNT. Кау у О w (i) | еш G9 O 
Latent y: Border. Xt VD C T : 
А 5 - d Je nt 
MESSA таъа | cays lgR) D. “tine | (Dime) i 
5 
0 | 244 | 2870 —12483 93-0 10 183 | 668 4 و‎ 
1 957 77037. —03514 262 6 126 102; 3 60 A. 
2 834 | .8072 —02142 160 а FR 158 2  $g 
3 -406 "9974 —0-0026 0:2 1 3-8 02 = 
4 :360 | 10000  —0.0000 00 0 = اڪ‎ - 
Total | 5-000 = зе ES DC MEE 93:0. 30. چ‎ 


Write 

* Note to Table III. Yn col. (i), & denotes the number of factors that have been extracted. 

4 = p — k for the number of roots Still to be extracted. Then in col. (iii), i 

S est Mesa... Ap Geom. mean of last 4 roots . 

= — ote 

Geass Axes +... Fg qth power of Arith. mean of last q roots 

col. (v) gives chi squared for the residual T ver Wh] 

the multiplier = N — p — } = 80 — 5 — 3 = 745, where N = the number of items responding 

the correlation extends, and p the number of persons correlated ; col. (vi) gives the B Jevel 5 

degrees of freedom, and col. (vii) the borderline values of chi squared at the 5 per fences from 

e factors (obtained by taking successive diffe: 


Ry 


> 1 stati 
> wn, partly to indicate certain analogies with OU OR ate 
tests (e.g., for homogeneity of variance). Note that, when the side correlations are арр at 
hierarchical, | A | can be calculated by th 


vector of sa turations, and U? the diagonal 
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k Now with a * principal axes ° analysis, based on the total variance (unity in the diagonal 
of the correlation), we can apply the chi-squared test in the form suggested by Bartlett 
ree factors at most would be sufficient 


(Table III. From the figures it will be seen that thi 

to account for all that is significant in the correlation structure. Further, on examining the 

pattern ! presented by the factor-saturations, it becomes clear that the four bipolar factors 

can represent little else than the results of forcing specific factors into the form of common 

factors. This can best be shown if we reconstruct a smoothed correlation matrix from the 
Pearson's method with unity 


saturations for the first factor alone, and then factorize it by 
ison in Table II B, where the artificial nature 


in the diagonal : the results are given for compari 
In Table II A there is certainly a suggestion of 


of the resulting factor pattern is very plain. 

ing F. B. and A. M. with the other three subjects ; 
f the specific factors, its significance 
Our investigator can therefore, with little or no hesitation, 
nt factor with positive saturations throughout, but that is 
t once the most economical and 
licable as the 


ve sets of correlated assessments are exp 
5.2 Only if this factor-structure is 


found unavoidably to leave residuals that are definitely significant shall we be warranted in 
factors in addition. 
fit His next step, therefore, will be to test his hypothesis by finding how closely he can 
fit is data with saturations deduced from it. To find the saturations yielding the closest 
$ed e will again use the method of weighted summation, but will now insert appropriately 
educed self-correlations into the diagonal of the correlation matrix. 
en ud he principle of economy demands that he should begin by basing hisestimated self-correlations 
em сїр Ob a single common factor first of all, and only introduce further ee when SE 
well Sith S liges him to multiply the entities he is postulating. Table I can ba itte TRN y 
signific one common factor alone. The largest residual is —0:088 ; and this can p a P 
with ance. Had the first set of residuals been statistically significant (as it might easily have been 
ith a larger group), then he would have to augment his self-correlations accordingly, 
re probable errors are fairly larg 


his calculati i 
; ons afresh. 1 like the present, whe 
E DC И р i d of weighted. On the other hand, if he 


In actual j 0 "th simple summation instea 
practice be content pilh ШШЕ chan (e first ‘ correct for unreliability ° (thus taking а 
\ , 


Wished to adopt а more rigorous (ement, e e i pally the 
matri ү J1 e NAT ga Were un v AC tn te former case ru 
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(and only Eator’s ultimate aim, however is to estimate the appro d on internal evidence f 
S far e eee ш factor, This factor р ravides n ns E em erion evidence of the balance, 
Sach SS the * saturation, ted dicients * f or thy SMO as estimates of the probable correlation 
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for Dur | 
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Five methods of analysis have been employed. (i) Weighted summation, with self-correlations 
of unity (W.S.u.s.) ; this, it will be remembered, adopts as the internal criterion Pearson's line of 
closest fit. (ii) Simple summation, with self-correlations of unity (S.S.i.s.) ; here the criterion is 
the straight unweighted average of the standardized assessments. (iii) Weighted summation with 
correction for specificity (W.S.c.s.). (iv) Weighted summation with reduced self-correlations, but 
aout correction for specificity (W.S.r.s.). (v) Simple summation with reduced self-correlations 

S.S. r.s.). 


TABLE IV. AGREEMENT OF INDIVIDUAL ASSESSMENTS WITH CRITERION 


(a) With (b) With Internal Criterion 
Persons External (Saturation Coefficients) 
Criterion (i) Gi) (iii) (iv) (у) 
W.S.u.s. S.S.u.s. W.S.c.s. WSIS: S.S.r.s. 
GOR. s -841 ‘861 +829 -818 +838 :863 
Вз yr 790 “851 -801 -805 -810 | 815 
А.М. và 634 “753 712 -669 -642 -625 
е: 5. -402 -556 -587 427 414 +399 
W. R. i$ +196 -320 464 :238 :221 214 
Discrepancy 000 107 149 -030 *015 018 
(Root-mean- 
square) 


TABLE V. WEIGHTING COEFFICIENTS (PARTIAL REGRESSIONS) 


(a) Based on (b) Based on Internal Criterion 
Persons External (i) (i) (iii) (iv) (у) 
Criterion W.S.u.s. S.S.u.s. W.S.c.s. W.S.r.s. S.S.r.s. 
GER  ,, 534 353 295 421 480 532 
BIB о, 346 350 295 378 400 403 
А. М. 154 308 295 204 141 10 
J. C. H. -040 228 295 081 048 022 
W. R. — 002 130 295 048 029 01 
Correlation 
(i) Main Set 914 :885 844 ۰907 910 208 
(ii) Second Set "862 "781 "837 885 -902 87 
T a 
Let us first. enquire how far the hypothetical correlations with the internal criterion pu 
trustworthy indication of the judges’ observed correlations with the external criterion, і.е, mean- 


weights. The closeness of the approximation can best be assessed by computing the root: es are 
square of the several discrepancies (see last line of Table IV). It is plain that the nearest ПЕШ ose 
the saturations furnished by the method of weighted summation with reduced self-correlations., те iS 
found by taking self-correlations of unity are decidedly inaccurate : the effect of this procedu 
to give a deceptively high figure for the accuracy of the poorest judges (especially J. C. H. and W- 
The worst figures of all are those furnished by simple summation with self-correlations of unity. © 
Estimation of the Criterion by a Weighted Combination of the Judges’ Assessments. E 
we have seen, neither the assessments of the most accurate judge nor yet a simple es of 
of all the assessments can be expected to yield the best available estimates for the Ve he 
theseveralboxes. Our problem now is to enquire how far such estimates are improve ights 
different methods of weighting that are available, For this purpose the necessary t nula 
or loadings will be calculated from the correlations or saturations by the usua for 
for partial regressions. the 
If we already knew the true values for each box, we could calculate the regressions fro mn 
observed correlations : the equation to use would be w = rR", where r stands for the rst dings 50 
in Table IV and R the correlation matrix in Table I with unity in the diagonal. The load! 
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obtained are set out in the first column of Table V. The figures indicate that A. M.'s judgments 
deserve less than half the weight of F. B.'s and little more than a quarter of that of L. T. R.’s, while 
the assessments of J. C. H. and W. R. have no significance at all. In general, of course, the regressions 
obtained in this way from a direct comparison with the true values will furnish the highest multiple 
correlation. Here, it will be seen, the multiple correlation amounts to 0:918. 

Now, as I ventured to suggest in my earlier papers (4, cf. 10 and refs.), even where we have no 
external criterion we can still apply the principles of multiple correlation to determine measurements 
for our so-called ‘factors.’ The regressions will be calculated by a similar formula (w' af RE 
where f’ will now denote any of the last five columns of Table IV). And generally, to determine the 
relations between the internal criteria (P) and an external criterion (c), we may use the same weights 
as those I have elsewhere suggested for computing the correlations between a set of factors and 
any external variable: viz., (in the notation of my previous papers) W = F^ (if F is non-singular), 
or F' R7! (if it is not), i.e., ZAF’ (if weighted summation has been used), or yf’ (if only one factor is 
required). To transform F to rme, we adopt the usual type of post-multiplier, viz., Erme = ЕМс = 
Pc’ = rp, making the same substitutions as before if F is non-singular (cf. 10 and Appendix, p. 124). 
The weighting coefficients thus obtained are shown in the corresponding columns of Table V. The 
correlations with the true values, calculated by using these weights, are appended in the last line 
but one. Confirmatory validation, as usual, has been sought by trying the same weighting 
equations with a second sample : see last line of table. 


It is clear that simple averaging (5 ii) yields the least satisfactory result. As we have 
seen, it gives equal weights to all judges. Even the Pearsonian method for finding the 
“line of closest fit’ (b i) tends in the same direction. On the other hand, the methods of 
simple and weighted summation with reduced self-correlations (b iii, iv, and v) here supply 
almost as close an agreement with the true values as was obtained from the partial regressions 
based on the true values themselves. In the present case the differences between the last 
three methods are too slight for any generalization to be deduced from the figures alone. 


Supplementary Experiments. To show that these results are in no way exceptional, 
but hold good in widely different fields, I append the figures calculated by similar methods 


from six other enquiries. Е 

The tasks were (1) to judge the weights of boxes (an experiment identical with the above but 
carried out with adults) ; (2) to judge the length of lines ; (3) to judge the areas of rectangular figures ; 
4) to judge the length of time-intervals ; (5) to judge the number of spots on cards shown tachisto- 
Scopically ; (6) to guess the heights of familiar buildings or monuments. In the first and last of these 
experiments adults (students) acted as judges ; in the other three, children aged 12 to 14. \ Table VI 
shows the final correlations between the ‘ true values ’ and the estimates given by various linear 


combinations of the assessments made by all the judges.* 


TABLE VI. CORRELATIONS BETWEEN JUDGMENTS AND TRUE VALUES 


Й le Based (b) Based on Internal Criterion 
e No Se | = © Gi) ii) v) v) 
Material Items Judges | External И/.5.и.5.  $.$.и.5. W.S.cs. W.S.rs.  S.S.rs. 
Criterion 
Mew a | | m ox om S 
Amas. 24 8 668 623 605 545 584 491 
ime- 
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The Influence of Differential Weighting 


The results may be summarized as follows. Where the number of items is small, or where the 
number of judges is comparatively large, and particularly where all the judges have much the sama 
degree of efficiency, there is little to choose between the various methods of weighting, and not S 
is lost if no weighting whatever is employed. But where the number of items is large enous! is 
reduce the probable error to small dimensions, and particularly where the number of ведер. 
to be averaged is small, there the use of an accurate method of weighting is plainly of Эра 
importance. In such cases the best procedure would seem to be the method of least squares (weigl 
summation) with reduced self-correlations. The method of simple summation usually yields MM 
Similar results. There is one exception—the experiment with areas. Here (as the introspect 
show) at least one large supplementary factor is almost certainly operative (representing subjec ot 
differences in mode of assessment). With so few cases the residuals for such a factor were Ex 
statistically significant, and tended to be somewhat erratic. However, if a second factor t 
allowed for in the self-correlations, these would approach nearer to unity. Hence the арр ity 
exception is in keeping with the general rule. It may seem surprising that the correction for speci he 
does not (except in one dubious case) improve the resulting figures. That is doubtless due to 
fact that the external criterion is not absolutely identical with the internal. m. 

Is Weighting Really Necessary? In discussing the weighting of test items for. vilium 
Guilford apparently concludes that “ what few results we have point definitely to the іа 
that such weighting is not worth the trouble " (loc. cit., p. 448). Can we then lay dM 
criterion for deciding when, if ever, such calculations are really worth while? I bore 
suggest, as a sound practical rule, that differential weighting may be regarded as er y 
when the correlation between the weighted and unweighted assessments is significa 
larger than the reliability coefficient for the weighted assessments. 


: :ehting 2 
form, (c) equal weighting as the simplest form, and (d) differential weighting 
determined empirically by various factorial procedures. value 


itiple 


observations), a priori weighting is generally worthless. both by 


Soy, : : dard 
random weighting (such as tends to be produced by accidental differences in stan deli 


zi 3 Pei ; b 
deviations) and by equalized weighting (where such accidental differences pese 
eliminated). The formulz indicate that the validity coefficients so obtained dati ons 


(i) with the number of weighted traits and (ii) with the size of the inter-corre tial 


between them. It is concluded that the choice of the best possible set of differe у 
weights is of special importance when the traits are few in number and differ 
in their diagnostic value. well a$ 
4. A series of illustrative experiments is described in which an external as terna 
an internal criterion was available. The results indicate that, when an on alysis 
criterion is not available, the best weights will be obtained with a factor method 
carried out, not by the classical method giving the ‘lines of closest fit’ (i.e., the mation 
of principal components with self-correlations of unity), but by weighted sum depart 
With reduced self-correlations. Where the correlation matrix does not 


122 


osite - 


8 


uy 


CYRIL BURT 


appreciably from a rank of y 
oi one, the further corrections for specificity or reliabilit 

roduce no discernible improvement, and simple summation gives almost d 

int as good a 


ult t weighted summation. 
. More generally it has been shown (i) that any method of weighting will yield 


a ' factor ’ in the accepted se 

FA nse of the term, and tha i 

RE DOC ше ШЕ of the matrix of dence e Tn 
Eun which SN e Jesds! o sid bir edm weighing 
onvenient working procedures. One such pee if the кынкы 
ds i s. Ones system is obtained if t i 

cdit me d equal, but differ in sign-patterns for аде уар аа 

S weighting á eme are merely summed algebraically as they stand. "This is 

аек Аг єр in analysis of variance. When the number of traits to be 
te When the a, is type of weighting does not yield altogether satisfactory results ; 

number is large, it yields a good approximation to the best result, 


parti 
asl for the first or ‘ general’ factor. 
better pre ae coefficients deduced from a particular sample necessarily give 
Predictors is one for that sample than for any other, especially when the number of 
Y application E Hence an empirical weighting equation should always be checked 
On with re eet nd experimental group. When this is done, weighted summa- 
on teon I f-correlations again seems to furnish the most stable results. 
Pi igo has been suggested, based on the reliability of the composite 
ng. e when differential weighting should be gubstituted for equal 
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The Influence of Differential Weighting 


APPENDIX 


1. In the text (p. 108) I have contended that the real or operative weight of any component 

is to be measured (as in factor analysis) by its contribution to the total variance. Thus, i 
y = BK +... + В„Х„ = Biou +... + BnGoXni = Оху +... + Unxn ; 
where B; = the assigned weight and U; = Bjc;— the actual weight, then 
S? = В,с,УВсин + ... + BaonSBisirnt 

= U, X Uru +... + Un Urns: А 
so that the contribution of j depends оп (i) Bj, (ii) oy, and (iii) the ry’s. If the components are T 
standard measurement and uncorrelated, then o? = U,? +... + Un, i.e., the contribution of eac 
component varies with the square of its weight. If the components are correlated, and given equa 
weights of unity, then су? = Уму... 4 Dry =ry +... + ту, be, the contribution 9 
each component varies with its correlation (or * saturation °) with the composite measurement, X 
But with differential weighting the saturations will not be proportional to the regressions ; ave 
sometimes; as with ‘ suppressor ’ variables, a test with a positive saturation may require a negati 
weight. 

2. Where the final criterion is external to the battery, the best method of weighting i5 n 
described in the theory of multiple correlation. Where no such criterion is available, theni 5 
stated above) the same kind of proof leads to the formulz for factor analysis by weighted summa d 
But, it may be asked, when both are available, how are we to determine the relation between 

wo? 

There are two ways in which the relation between a factor and an external criterion miy бе 
assessed. We may correlate either (i) the factor-measurements (p) with the criterion measures 
(0), or (ii) the factor-saturations ( f*) with the correlations between the criterion and the ser 
assessments (rme). The former is in my view the proper procedure ; the latter provides a conveni 
short cut, but has sometimes been adopted without a real understanding of its ітрісайоп pe 

(i) If p is in unitary etandard measure, then the correlation between the measurements Me 
Гре = pe’ = WMC = VIS tme = VY rme = (Ff) ‘rne. But if p consists of estimated e by 
ments, then its variance will be wf = Rf; and this is only unity when f has been derive 
factorizing the correlation table with self-correlations of unity. Hence the formula used in 
text iS ме + A/ wf. i (rn) 

(ii) The “unadjusted correlation’ between the saturations and the criterion correlations я р 
will be me ==, = Vf fme, if we take nerme as = f'f approx. It will be seen that, W! 

МРГ) (тетте) when the 
this assumption, the two methods of calculation lead to the same result. Moreover, rtional. 
“unadjusted correlation’ = 1, then the saturations and criterion correlations are ргоро тегеп 
If, however, we desire to compare several sets of saturations (e.g., sets obtained by h this is 
procedures), then it would seem better to use the square-sum of the differences, thoug! 
again closely related to the unadjusted correlation. И riginal 

3. The full proof of the equations (iii) and (iv) given on р. 111 was given in mY reader 
(unpublished) memorandum (cf. 10, p. 246): though lengthy, it is straightforward. Ti may be 
will find no difficulty in reconstructing the argument for himself. The essential steps 
summarized as follows. > tween the 

Using the notation suggested on p. 111 consider the product-moment correlation be Sere 

, 


weighted sums, y and z, defined by equations (i) and (ii) ; write the correlation ry: = Du De 


N and D.D» denote the numerator and denominator respectively. 
Then, since the measurements are in unitary standard measure, 
n(n-1) 


n 
N= УШИ + X Ши. 
2 tj 
Putting U; = U + шапа Wi = W + wi, we have 


n 
XUuWi-nUMW--Yuw. K 
; lations with 
Now U: and Wi, having been chosen at random, must have only non-significant or ai vergences 
each other and with the values they are used to weight. Further, unless there are wide r denotes 
among the inter-correlations, we may take Y; U: Wj ri; = r X; U, W; approximately (where ^ 
n(n-1) NAP. » ima ia) 
the average value of ry, i + j). Hence ‚>, UWirg = n(n — 1) U Wr + F Xi ш wu approx? 
i$j 


Е ee 10: 
1 For the formule for calculating the correlation between factors and external variables, 9 
also J. Psychol., VI, 1938, p. 355, eq. (vii)- (ix). 
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But, fo 
, for the same reason as before, У) uv must tend to zero. Hence 
E niea TD. М= п UW (1 + п — 17) approximately. 
. Now; ie e T a + (ı — 1) 7) + (1 — 9 92) with a corresponding expression for Dw. 
orrelation ry. depends solely on the relative proportions of the weights ;_hence it 


Will be unaff i 
Since by h ected if we multiply the first set of weights by some constant, so th = 
ypothesis cy? = oz" = 1, we have си? = c^; and therefore , so te UT He e 


— п? (1 +л—10),_ 
n (O 4n — 17) + @ SEDES 
nr+0 -0 
arta- ED 


Divid; 
lviding out we obtain 


P 1-7 94 D 
Tuo = 1— Un -7e + terms in (2) and higher powers. 


If we 
г i A p E 
DD, can ры retain the original values for the weights, so that U + ЎЎ, and ou + ox, then 
before e ored by completing the square *; and we reach the same general expressions as 
Е pe that for Es we substitute 3 Uv + yr" 
€ Su А 3 4 " Р a 
B ШИЛ) } theta that the inter-correlations are too widely divergent to substitute 7 X; UW; for 
is merely to q gebra becomes somewhat heavier ; but it is not difficult to show that the effect 
o dimini: A 
iminish the value of the negative term, so that it tend 


reached by the ust of matrix notation. The 
r-measurements, and the vectors of weights, 


s towards 77 - My ty 


4 
onse. The Я j 
ollowin other equations given in the text are readily 


Y, B re: 
v and w, mer be noted. Ify апа z represent facto! 
‚ are based on factor-saturations, fy and fz, then 
wy RW fi ROME 
МУК. Raf 


та = oo ES 
Vy Коюу. We’ R Ws 


fw 
v has be А ; 
en obtained by a Pearsonian analysis, гу: »z is standardized. 


Eh =f ws , assuming W 
» however w, = [1,1 И ЯН Ш г 
‚1,...1] = wo (in the usual notation), then г = Vw, R Wo 
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NOTE ON SIR CYRIL BURT'S PAPER ON k 
Б DIFFERENTIAL WEIGHTING | 


By GODFREY THOMSON 
Moray House, University of Edinburgh 


I fear the preceding paper may be misleading unless it is read with the following warning а 
in mind. Iam thinking more particularly of the experiments with lifted weights, etc, I 
which it was found that the method of weighted summation with reduced self-correlations 
provides almost as good an agreement with the true weights as would be obtained [ош 
the partial regressions based on those true weights ; and the method of simple summation 
provides estimates that are almost as good (pp. 121, 123). The conclusion implicitly ауы 
by the writer is that * reduced self-correlations,’ which most of us call * communalities, ds 
justifiably used in analyses of psychological tests—a conclusion about which I am n 
sceptical. In the experiment with lifted weights we know that the correlations are еше 
due to one factor, the weight, which is the only thing common to the five persons ee o 
the judgments. In that case, it seems to me obvious that it is proper to reduce the тап Lo 
the matrix, by the insertion of communalities, until it is of rank one, or as nearly rank fat 4: 
as the inevitable blemishes will permit. Burts experimental confirmation of this is in es 
esting, but the result seems to me to be a foregone conclusion. To reverse this argum 
however, seems to me go be dangerous. In a psychological set of correlations betwee a 
tests (and the persons in Burt’s experiments correspond to tests in the usual psychologies x 
analysis, while the weights in his experiment correspond to the children in the usual Prat 
logical experiment), even when the rank can be reduced by communalities to one, have 
know that there is only one factor causing the correlations. The correlations may sich 
been caused in many other ways, and in particular might have been caused by en enes, 
call upon random samples of a large number of qualities whether inherited or acquire | P: the 
bonds, pieces of information, fragments of skill, what you will). Itis true that hien. 
rank can be reduced to one, the tests have acted “ as if ” they had only one factor in conrad it 
together with a large specific factor in each test, but this is not necessarily the ИСЕП js to 
seems to me that to insert the minimum possible communalities in the diagonal ce t has a 
make the assumption at once that there is only one common factor and that each tes AE 
very large specific factor, and to make this assumption solely on the mathematical gro con- 
that the low rank can be attained, or very nearly attained, without any psychologica 
sideration whether these large, these enormously large, specific factors actually exist. know 

It seems to me that we are only justified in using minimum communalities if (а) ier 5 
that the correlations are only due to a certain small number of common сараен ofa 
the case in Burt's experiment, or (b) we think it worth while to make the assumpti sycho" 
similar number of hypothetical common causes for convenience of thinking. " ра прег 
logical experiments we can seldom, if ever, know that there is actually only a sma were 
of common causes acting. It may be worth while to speak and to act “ as if Pi to have 
case, but what I am afraid of is that these purely hypothetical entities may be he B 
a proved existence in a much more definite sense, sometimes even in a physical sense 
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d A REPLY TO SIR GODFREY THOMSON’S NOTE 


aad І im most grateful to Sir Godfrey Thomson for his comments on my paper. With his warning, 
a me h the gist of his final conclusion, I am, I think, in general agreement. He puts his finger 
justifi 2 to my mind is the most vexed question at the present moment, namely, are psychologists 
fa fied in substituting * reduced correlations for unity in the diagonal of the matrix to be 
ctorized? He thinks this change unjustifiable. Presumably he would revert to Pearson’s 
Е procedure—though he does not explicitly say so, nor can I recollect any factorial investigation where 
€ has himself adopted that method. 
ч foll Space does not allow me to consi 
, = following brief comments. 
| a e 1. In the foregoing paper I was only concerned to compare methods of weighting, not to adduce 
t idence for the assumption of “ one factor causing the correlations. I have nowhere attempted. 
р prove such a conclusion by “ reversing the argument "1 апа I fully agree with Thomson that it 
s uld not be done that way. If I did not insert a more emphatic caution on this point, that was 
1 imply because I had dealt with this particular criticism in the last number of this Journal. 
2. My reduction of the * self-correlations ' is based not, as Thomson seems to suppose, on 
Considerations of rank, but on considerations of significance. (I avoided the term * communality " 
Just because that has become associated with the idea of reducing rank.) My general grounds for 


preferring reduced self-correlations to self-correlations of unity have been given in earlier papers, 
and were summarized on p. 113. With these grounds, however, Thomson does not deal. The 
Object of ** the experiments with lifted weights, etc.," Was, as stated in the opening sentences of 
SD te the effects of different types of weighting ° where an objective 
Criterion was available, not to justify the reduction of self-correlations or communalities as a general 
Principle. "That would have been far too wide a question to re-examine in an article dealing with 


der his arguments in detail. But I should like to make the 


* in the experiment. with lifted weights we know that the 

Correlations are entirely due to one factor." With this, I fear, I canno: agree. On the contrary, 

r netrongly suspect that the correlations are due to more than one common factor. My point was, 

[| Ot that no further factors existed, but that, in the experiment described, the additional factors were 

90 small to be statistically significant ; and in such a case, so I should maintain, all we can take for 
Stanted is the existence of the one common factor whi shed as fully significant. 

4. Nor can I agree that this result was “ a foregone conclusion.” In fact the outcome of the 


Experiment on judging areas seems to show that it was not. Here there were other factors. 
th n the margin allowed by the probable errors) 
е 


5. At {һе sc ime. in thus assuming that (withi 
$ correlations can ROM. for by means of only one common factor, I am not for one moment 
f Pposing that this single factor is therefore simple. The fact that, as Thomson suggests, a general 
Actor like intelligence may be biologically due to a large number of genes, OF neurologically due to 
arge number of neural bonds, does not appear to me to be in the least incompatible with its. 
бер ion as a single independent factor. Sweden may be a single independent State though it 
ists of over six million persons. ; “ 
«a. 6, TI e а at in-cases such as I have described the tests have acted 
BS if? E pta bdo e With this form of expression I should heartily agree. 
Ut then, surely every empirical generalization in any and every science can state no more than this, 
9r example, in mechanics, chemistry, dynamics observable changes take place ^as if 
Derey ws Cag but does anyone suppose that energy and similar " purely hypothetical 
^ e definite sense 


entities ” h, but, 1 ? 
ave a “ proved existence in any mon Ad i А 

Te Thomson like other critics, may have misunderstood my IS разни in previous 
Papers, on the inclusion of an external criterion (such as the teacher s ES us general 
futclligence *) and on the calculation of the correlation between this guise an с BA 
Or-measurements Some have apparently inferred that the object of such calculations Was 

ted T itici dvanced in his most recent 
The T: Kah 5 152. With the criticisms advan en 

wo- Journal, ЇЇ, p. 152. 0 
vine (8) Ho Factor Theory, tin Brit. J. Sociol, I, pp. 162.4 „Му agree oh between- factors ut 
i implied i c ated the 4 een “i à 

ш P istital comi implied ip the text, where rete entities 3. «a confusion ” (1 added) “ against which 

i 2 fessor Thomson has so rightly and so often protested ” (p. 106) : 
ope? Section on ‘ The Reality of Factors. If I am aske! aid that the hypothesis is based onem. 


Сопу; 


y or verifying that hypothesis. А 
E is now t hion to say, ' hypo T 1 ( 
Y үпте). Timon gather, is inclined to reject the inference : as he has 
Бер Dot necessary to postulate the existence 

Ver in the existence of a faculty called * general 
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i 3. Thomson goes on to say that 


and thermo: 
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Reply to Sir Godfrey Thomson 


literally to identify the general factor with a * known common cause (as Thomson calls it); and 
they have accordingly asked : why not go further, and maximize the correlation of your factor with 
that criterion? My reply ! would be that, with a properly planned experiment, the general or basic 
factor should be far more trustworthy than the external criterion. The calculation of the correlation 
between the two was intended to throw light on both—to correct the empirical criterion as well as 
to check the interpretation and labelling of the factor. Thus, in the case of * general ability or 
* intelligence’ (which, I take it, is the kind of “hypothetical common cause’ Thomson has more 
especially in mind) I should regard the factor-measurements derived from *a psychological set of 
correlations between tests’ as furnishing better estimates than the ‘criterion’ provided by the 
teacher, though his assessments might at first sight seem more objective. May I add that in my 
view the * assumption of such a hypothetical common cause ° is made, not merely * for convenience 
of thinking,’ but also for the convenience of practical work—e.g., allocation to schools or predictions 
of future performances ? 
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ERRATA 


l. On p. 29 of the previous issue (Vol III, Part 1) in Table IV A, col. 3 (bipolar factor Ш), the 4 
signs for the first 13 traits should be negative, and for the next 7 positive (the division thus correspond- 
ing with the division between group factors E and G); the next 9 should also be positive, and the 
last 9 negative (to agree with the allocation of signs adopted on p. 32). 


. 2 On p. 67 in Table XVIII B, the figures printed in the first two rows and columns correspond 
with an alternative division of the first four tests (namely, into 1-2 and 3-4). To correspond with 


ET division adopted in XVIII A and C (into 1 and 2-4), Table XVIII B should therefore read as 
OlHOWS : 


1 2-4 5-7 ` 8-10 , 
1 | -7805 1:5695 1:3995 1:5105 1 
2-4 1-5695 3-4206 2-7605 2:9795 
5-7] 1-3995 2-7605 3-1996 2:7200 
8-10 1-5105 2:9795 2:7200 4:9601 


1I should also answer that, as a rule, it would be better to maximize the correlation of the criterion [ 
with the fests than with the factor (i.e., to use ordinary multiple correlation) : but here I ш @ 
concerned only with aims, not with methods. І тау add that, when I first suggested the десс An 
of correlating internal factors with external criteria, I used two alternative criteria : the object we Ae 
to see with which 9f these two abilities the general factor could best be identified. And J went on | 
emphasize that the factor itself, ** though single is not simple, but complex " (4, 1917, p. 55). 
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THE VALIDATION OF SEASHORE’S MEASURES OF 
MUSICAL TALENT BY FACTORIAL METHODS 


By JOHN McLEISH 
Extra-Mural Department, University of Leeds 


I. Problem. II. Subjects and Tests. Il. Results : (a) Factor Analysis of the Seashore 
Battery ; (b) Multiple Correlations with External Criteria. IV. Summary and 
Conclusions. 


I. PROBLEM 


The object of the research described in the following paper was to attempt to 
validate the Seashore tests of musical ability by a factorial procedure, using both 
internal and external criteria. The need for both kinds of validation appears to be 
now generally admitted in the more familiar fields of intelligence and educational 
trang. But it has been almost wholly ignored in the construction of tests for musical 
ability. 

The problem of scientific validation is one of the most difficult with which the 
constructor of any kind of test has to deal. The commonest practice is to take some 
external criterion with which the test-results are then compared. This, however, is 
logically unsatisfactory, since it is just because external evidence is tacitly assumed 
to be unreliable that we have undertaken to construct a standardized test. The 
argument on which the validation is based is consequently circular: a criterion, 
admittedly imperfect, is used to demonstrate the worth of the test which is to replace 
it. The use of multiple criteria does not altogether release us from the logical dilemma 
to which we are thus reduced, for the multiplication of dubious or biased testimony 
does not necessarily make the conclusion more certain. 


The argument may perhaps best be illustrated from the field of intelligence testing. In his earlier 
work, Spearman used as his criterion the ratings for intelligence submitted by teachers. Не believed 
that the unreliability of their judgments could be eliminated by securing multiple criteria, i.e., ratings 
from two or more teachers for the same children, and then correcting the raw correlation between 
the average judgments of the teacher and the various tests he used. But, as Burt showed in his 1909 
article,! the judgments of most teachers have a bias towards verbal ability and ability to learn ; he 
therefore proposed instead to use an infernal criterion in the shape of the * highest common factor * 
underlying the whole table of correlations between the tests selected according to a psychological 
plan. At the same time, in deciding the nature of the internal factor so found, as he went on to insist, 
it was not sufficient to trust to the names of the tests or to subjective notions of what they are intended 
to test ; an objective check, based on improved external evidence, is always desirable. The process 
is then, not a vicious circle, but rather a spiral of successive approximation. 

The danger of omitting this double check becomes especially clear in the endeavours made from 
time to time to validate tests of musical ability, such as the Seashore battery (9). Dr. Stanton's 
“Eastman experiment ’ is perhaps the most comprehensive and painstaking attempt at validation of 
any musical test so far carried out. The methods used in that experiment (11) and in other investiga- 
tions by the same author (12, 13) included retests of students of music after training, comparison of 
their scores in the tests with their teachers’ ratings, the development of a cumulative * key for 
Prognosis’ by empirical studies, a comparison of the gradings furnished by this key with all other 
data available about the students—their class-marks, recitals, dismissals, etc. On the basis of the 
results, Dr. Stanton concludes that the * cumulative key ' (which included both the Seashore records 
and an intelligence test) forms a “ useful administrative tool for the selection and classification of 
Students ; . . . the tests, which became the basis of classification, can do quickly what the faculty could 
learn only after long observation and experience with the students (11). 

At the same time she fully recognizes that the manifestation of musical talent depends on many 
Circumstances apart from the possession of the innate ability (or set of abilities) which the Seashore 


s Experimental Tests of General Intelligence,” Brit. J. Psychol., II (1909), pp. 168f. 
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tests were designed to measure. Among the more obvious of the extraneous influences that play an 
important part in determining the level of performance or appreciation, she enumerates the following : 
** poor health, insufficient finances, lack of ambition or effort, family demands, personality problems, 
religious conflicts, * inferiority complexes,’ insufficient emotional poise, misdirected goals, disinte- 
grated concepts of prerogatives [sic], incoherent plans, unfavourable rapport between teacher and 
pupil, poor musical training, inadequate musical preparation for the educational level attempted, 
extreme personal jealousies, marriage, over-anxious parents, exploitation by patrons, teachers, Or 
parents" (11, p. 131). Such a list is, in effect, a list of flaws in the normal validating procedure. 


The investigation reported in the following pages should be regarded as a 
supplement to Dr. Stanton's investigations rather than a criticism of them. Its chief 
aim was to make good the more obvious shortcomings of previous enquiries, namely, 


the failure to apply available statistical techniques and in particular the techniques of 
factor analysis. 


„Тһе earliest mention of ‘factors’ underlying musical ability, which is based on any formal 
statistical procedure, appears to be that contained in the Board of Education's Report on Psychological 
Tests of Educable Capacity (introductory chapter, prepared at the request of the Committee by Burt, 
p.20); and in his survey of group factors, published a year or two later (2), he mentions “a distinct 
musical capacity, no doubt highly complex," as definitely established. The data on which this 
conclusion was based were obtained from experiments carried out by Burt and Pelling for purposes 
of educational and vocational guidance, partly in L.C.C. schools and partly at the National Institute 
of Industrial Psychology. The novel feature in the approach consists in the addition of a number of 
more complex test-items, intended to measure “ the Gestalt-qualities of musical understanding, as 
contrasted with the atomistic approach adopted by Seashore and most other investigators hitherto." 
Susmo were later made by a succession of research-students, working at the Psychological 
I рог ову, University College (17 and refs.), to determine the more important components of this 

РИ complex capacity : the factors found included elementary perceptual abilities both auditory 
and kinesthetic, auditory and kirzesthetic imagery, a broad factor for zsthetic appreciation generally, 
a more specialized factor for musical appreciation as such (largely influenced, it would seem, both by 
musical experience and by temperamental inclinations), and finally general intelligence.* "The last 
and the most recent of these researches was that carried out by Wing (18 : for fuller details reference 


оре made to his long and elaborate thesis (20), which also gives a fairly complete historica 


Several writers have publish inter- i i а ; 
BEG ario published the inter-correlations between the tests in the Seashore battery ; 


(4) have published figures which seem fairly ге ive ; i 19) 
) 1 presentative ; and Wing (18, 
E геш a factor analysis of the Seashore tests by Burt. With the exception of the last, 
attempted te Bee hoe Pate in its nature and limited in its scope, none of those who have 
the tables of correl ea үт! and reliability of the battery have taken the logical step of factorizing 
Lieu SE lons between its constituent tests. Instead, both Seashore's disciples an 
eem often content to rest their conclusions largely on anecdotes of personal impressions. 


pu EE] Ee to reconcile n views of American investigators like Stanton with the 
a 1 ike Wing, wı isti i i 
Dae ae ele us ie ving, o contends that the tests are atomistic, boring, musically 


The methods of factor analysis would seem to be pre-eminent! ! 

[ | c ] - ted for the 
ers Rt the logical difficulties which arise in studying such е, It is un- 
is so oftei los by that the objectivity which results from the use of such techniques 

h n lost by the way in which the factors are subsequently identified. This, 
т m aye aen often rests solely on subjective interpretation. Even those who 
ave claimed to demonstrate a factor of musical ability underlying their tests have 
either used no external check at all or else have been satisfied to compare the test- 
results with ratings obtained from teachers, whose opportunities for judging genuine 
musical ability are, as a rule, extremely limited : ability in playing the piano ОГ 
singing in tune is hardly a safe criterion. T жо p 


1 Burt, C., * The Psychology of Music’ (Gresha i 
> The, 9 Д > m Lectures, 1932. i i wing (18), 
p. 83) ; also id., * The Factorial Analysis of Emotional Traits,” Chae E O рр. 2R D. 


? 17, pp. 266f., and refs. The tests used by Burt i " f 
the more elementary processes discrimination and his collaborators consisted, not only of tests 0 


Е of pitch, rhythm, 1 a f apprecia" 
tion of melody, harmony, short musical pass; LA loudness, etc.—but also of apr n 
partly by the piano and partly by анори longer extracts, and ‘altered versions’ (give 


: b records). ЕЧ БЕП included; 
e.g., marking well-known tunes in order of preference i a es est 
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In the light of the criticisms expressed by Burt and his students—Wing, Williams, 
and others—it was felt that a new approach to the question of validation of the battery 
was needed. In particular, Seashore's assertions that intelligence and other non- 
musical factors had little or no influence on these measures of * elemental’ musical 
ability required to be examined in the light of experimental results. Part of the aim 
of the present investigation therefore was to attempt to establish whether the factors 
resulting from an analysis of the correlations between the several tests could be 
approached more objectively by comparing the factors with independent evidence 
regarding other musical and non-musical abilities of the persons tested. As a result 
it is claimed that a closer approximation to an objective interpretation has been 
achieved. 

The question that presents itself at the outset may be formulated as follows. 
What elements in the testing situation, apart from differences in musical abilities of 
the subjects, might conceivably affect the scores obtained? A preliminary study of 
the test battery and the relevant literature suggested that the following elements were 
probably important (8, 10) : 

(i) Age. In view of Seashore’s expressed opinion about his * measures of musical talent '— 
that pitch discrimination (8, p. 65), sense of time (p. 113), consonance (p. 157), tonal memory 
(p. 242), intensity (p. 99), and even the sense of rhythm (p. 126) are innate elemental capacities not 
affected to any considerable extent by training or environment—it is evidently desirable to ascertain 
the actual relations existing between the age of the subjects tested and their scores on the six tests 
in the battery. Seashore holds that these six capacities mature quickly in very early childhood ; 
so that by the time the child sufficiently understands the instructions it is possible to obtain a quantita- 
tive measurement of his natural capacities. On the other hand, Stanton and Koerth (12) have found 
marked differences between the performances of pre-adolescenty adolescent, and post-adolescent 
groups, which obviously throws doubt on Seashore's assertion. They explain the discrepancy as 
arising from the fact that, in the case of the adult, one is actually measuring the psycho-physical 
threshold, whereas with the child one is measuring the cognitive threshold. The former, they hold, 
cannot be measured until the age of maturity, which they place at sixteen years. Unfortunately 
they report no correlations between age and score for the various groups studied. 


(ii) Intelligence. Seashore, as we have seen, claims that intelligence has very little influence 
on the scores obtained with his battery, provided, of course, that the subject possesses the necessary 
minimum of intelligence required to obey the instructions (8, pp. 56, 98, 113, 157). On the other 
hand, Burt (quoted by Wing, 21, p. 342) found that “ when intelligence has been partialled out, the 
net correlations between the tests drops to little over 0:10, and are thus, with the groups we have 
tested, statistically insignificant." He concludes that most of the correlation between the tests, at 
any rate with children of the type he tested (namely, pupils at elementary schools), is due quite as 
much to intelligence as to innate musical ability. With older groups, more homogeneous in general 
intelligence, for example, children of a secondary (grammar) school type or students, the effect of 
intelligence proved to be somewhat less important. 

Gii) Speed of Judgment. On introspective grounds, speed of judgment would seem to be an 
influence that might have quite a considerable weight in affecting the scores. With the tests of 
Memory, Rhythm, and Consonance, although the stimuli are presented at a speed suitable for distinct 
hearing, nevertheless the test itself requires a definite decision : it is a matter, not of just listening, 
but also of making up one's mind about small points of difference ; and at the rate the stimuli are 
given mere quickness of decision would seem to be important. 

(iv) Interest and Boredom. As previously remarked, it has been suggested by Burt and Wing (21) 
that, for persons who are interested in music, the lack of a genuinely musical content renders the 
Seashore tests comparatively meaningless and boring. This, they say, becomes most conspicuous 
with the professional musician. Seashore noted much the same reaction in certain musicians he 
tested ; however, on retesting, after taking care to establish better rapport and eliminate the attitude 
of negativism, the difficulty, he claims, was overcome, and the correct psycho-physical level of the 
Subjects secured (10). д 

(v) Musical Training. Ѕеаѕһоге (8, 10) (and both Stanton and Koerth confirm his opinion 
(12, 13) ) claims that the scores of subjects are uninfluenced by the amount of musical training they 
have had before the test. According to Seashore, the tests are truly elemental, that is, “ so simple 
and natural that the resulting record does not vary with training " (8, p. 65). 


In view of these and other considerations, the following problems were formulated 
for experimental study :— 
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(a) Does the Seashore battery measure a unitary ability, such as would be manifested by a general 
factor, or does it measure different and independent aspects of hearing, as Seashore himself suggests ? 

(5) Assuming that the battery does measure a unitary ability, is it possible to say in what this 
consists? Alternatively, if there is a ‘ general factor,’ is it conceivable that it might represent not 
a unitary ability, but an aggregate of many common elements ? S 

(c) In the latter case, assuming that these common elements can be statistically eliminated 
e.g., by partial correlation, and that there is still a residuum left over and above these, can this 
residuum be identified with * musical talent’? 

(d) Lastly, since Wing has been the most consistent critic of the Seashore tests, how far do the 


Scores obtained with the Seashore battery by a given group of subjects agree with their scores for 
Wing's own tests ? 


It will be seen that this approach to the problem of validation is in direct contrast 
to Seashore's own attitude. His procedure can be described as basically ad hoc. He 
conceives of the * measures ’ primarily as a sieve for the elimination of the musically 
unfit, rather than as a quantitative instrument for measuring musical ability. His 
use of the psychogram in place of weighted or unweighted averages for describing 
individual endowment is the natural corollary of his view that the abilities measured 
are independent. There seems in fact, as we have already implied, to be a striking 
similarity between Seashore's conceptions and methods and those of intelligence 
testers of the pre-factor era. Seashore's supporters might perhaps retort that the 
later development of a mathematical superstructure has tended to make its employers 
lose sight of significant individual peculiarities behind a mass of numerical coeffi- 
cients. In Seashore's statistical methods one never forgets the individual; but, on 
the other hand, no clear psychological picture of the abilities tested is developed. 


t 


II. SUBJECTS AND TESTS 


The tests described below were applied to a group of 100 subjects, all under- 
graduate or postgraduate students of psychology. To ensure comparable conditions 
in the procedure, every test was administered and scored by the writer. The 
sre vents were treated as part of a normal laboratory course in experimental 
Psychology, and in this way maximum co-operation was ensured. The series of 
E was presented as a research project, the reasons for each variable in the 
ests and methods being explained in advance, so as to arouse interest, but always of 
course so as not to interfere with the proper application of the tests, As a result, 
interest and motivation were very high. The two or three students who at first 
Pee n n. den were afterwards found to be * tone-deaf’ or to have an 
exceptionally low degree of musical sensitivit In all, th i iabl 

А : . , the fol variables 
were included in the enquiry. á ; и 

1. Аве. 

2. Intelligence i i 
TM ie ig (measured by the Cattell Group Test ША, (a) timed (for ‘ speed 7), (b) untimed 

3. Speed of reaction. For the pur i i i 

р de. fas Л pose of measuring this variable, a c cellati st was used. 
A жоп He test, with an auditory stimulus, was impracticable with so Taree x ES 

Ке ie. vagn test oF musical discrimination. This was scored in two parts: (a) correct 
preference ‘or the original musical passage ; (b) recognition of the nature of the alterations made in 
the original piece (i.e., whether the alteration is of melody, harmony, or rhythm) 

, 5. Recognition of orchestral instruments. This test was included in order to ascertain the amount 
of informal musical training or interest possessed by each subject. It consisted of fifteen instruments 
playing in зоос a the Same passage from Schubert's Ave Maria, the instruments being recorded 
on a Wirek recorder in random order. The subjects were asked to name the instrument, or, failing 
this, to say to Shidi section of пе orchestra it belonged E be 

6. The Seashore measures of musical talent. These consi i iscriminati 
> А Л . ist ation, 
intensity, time. rhythm, consonance, and tonal memory (9), poner ape srt 
except that in the case of pitch, intensity, and time, when the subject could detect difference, he 
was asked to reply * equal,’ instead of guessing. dms 

- 7. Rating for interest or boredom.  Immediatel 


: i 1 ۴ ly after the applicati 2 i st, the 
subjects rated it for interest on a five-point scale (20). Interest Was eee St each та аа в 0 
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which attention could, be maintained throughout the test, and the various points on the scale were 
carefully defined. 

8. Wing's test of musical intelligence. This consists of the seven tests—chord analysis, pitch 
change, memory, rhythmic accent, harmony, intensity, phrasing— presented, like the Seashore battery, 
on gramophone records (21). Unfortunately, only half of the subjects completed all the seven tests 
in the battery ; the rest completed only the first five. Nearly all the subjects found some difficulty 
in sustaining attention, even though two sessions were allowed with rest pauses between. This 
perhaps is partly attributable to the recording, and partly to the use of the piano for every test. 

9. Questionnaire on musical interest and training. A questionnaire of the open or essay type 
(15) was freely adapted from the questionnaire used by Seashore (8, p. 17). The subjects were 
encouraged to write at length on their interests in music, and to be as specific as possible in mentioning 
composers and pieces. " 

It should be emphasized that the methods adopted were a compromise between 
the possible and the desirable. In particular, the cancellation test, the interest ratings, 
and the questionnaire were included only as rough trials ; and the conclusions base 


on them must be regarded as suggestive rather than conclusive. 


III. RESULTS 


In this paper only those results which have a bearing on the validity of the 
Seashore battery will be reported. In the calculation of data, all scores were first 
normalized and standardized. Calculations were taken to four places of decimals, 
but only two places are printed. " 

(a) Factor Analysis of the Seashore Battery. The correlations obtained are shown in 
Table I. With one non-significant exception they are all positive; and range from little 
over zero to 0-43. Except four (all correlations with Rhythm) every one is significant. 


TABLE L INTER-CORRELATIONS OF THE SEASHORE TESTS 
_-_—————————— 


Pitch Intensity Consonance Memory Time Rhythm 
Pitch .. "s (85) 39 27 42 :25 14 
Intensity 05 (89) “18 -22 -28 10 
Сопзопапсе.. `06 ‘06 (34 :43 24 —12 
Memory de 05 “06 05 (93) :23 43 
Time .. - 06 `06 -06 *06 C81) :03 
Rhythm da -06 07 -07 "05 07 (+82) 
Factor I T ۰62 48 “39 -87 42 :28 
Factor Il, 18 27 16 -42 26 —45 


Split-half reliabilities are shown in the diagonal cells, and probable errors below the diagonal. 


The coefficients are of the same order of magnitude as those reported by Brown (1); 
while the reliabilities are somewhat higher than those reported by Brown or Drake (1, 4). 
The higher reliabilities are perhaps attributable, at least in part, to the use of the reply * equals 


in cases of doubt. 

A factor analysis w 
Successive approximations fo: 
foot of the table. Since, in à 


as carried out by Burt's method of simple summation (3), using 
г the self-correlations. The saturations are appended at the 
ddition to the factor-saturations so obtained, i have Les 
for th iabilitv of each test, the results of the analysis may be expressed in terms o 
arte foucdacior Doy (3, p. 103 and refs). We have in fact factors of four different 
types : (i) a general factor, (ii) a bipolar factor, (iii) a number of specific factors peculiar to 


each test, and (iv) factors of error or unreliability. 
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(i) The General Factor. The general factor accounts for about 30 per cent. of the mm 
variance. The question of greatest importance is the psychological nature of this genera 
factor. 

As has already been indicated, the common practice is to interpret each factor bya TES 
scrutiny of the tests into which it enters. This practice has been sufficiently criticized by 
Burt in this Journal and elsewhere ; and would certainly be most inappropriate here. 
Fortunately we have been able to secure results from a number of other tests, musical and 
non-musical, selected, in accordance with the general plan of the research, as likely to throw 
light on the problem. Let us begin by comparing the saturations obtained by the six tests 
of the Seashore battery with their correlations with the various musical and non-musical 
tests. The result is a suggestive series of figures, set out in Tables II and III. 


TABLE H. SATURATIONS FOR FIRST FACTOR AND CORRELATIONS WITH 
MUSICAL TESTS 


Seasho! n Oreg O uestion- Orchestral 
Test е | Factor I Wing a oe Q sak етв 
Memory .. 87 68 67 73 эз 26 
РЕН. — .. 62 61 47 47 24 18 
Intensity 48 22 27 41 23 30 
Time .. : 42 25 34 23 :03 12 
Consonance . . 39 44 46 43 :36 14 
Rhythm 28 31 31 45 01 1 
Average. 51 e ` лр 45 20 20 


——————— کک چ ا ا‎ E” 


The Seashore tests have been rearranged according to their first factor-saturations. If 
we compare this order with the orders indi i i i 


provide a strong confirmation of the hypothesis that the 
ore battery is intimately related to musical capacity. 

onsonance (and the justification for this will emerge in the sequel), 
e questionnaire gives exactly the same order ; Wing's tests and the 


respondence. 
pare the order of the factor-saturations with that of the 


TABLE Ш. FIRST FACTOR-SATURATIONS AND THE NON-MUSICAL TESTS 


Seashore Cattell Cattell Cancella- 
Test Factor I (timed) (untimed) Age Interest tion 
o8 3g 9S S s у 
nn E * s . -02 B 9 
Intensity — .. 48 13 25 35 20 02 
Time .. sa f 42 13 1 4 У —:03 
Consonance . . 39 ۰03 id E 06 2 
Rhythm  .. | 28 17 : 0. + 
у! 19 —17 —-05 16 
Average as | SI 17 16 07 25 08 


The coefficients themselves are now much lower. There appears to be some relation, 
Presumably causal, between the six Seashore saturations and the ratings for * intelligence 
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(especially * timed 7) and for * interest’ ; but none with age or ‘cancellation.’ It would seem 
that speed at the low level measured by the cancellation test is not particularly important for 
any of the tests, but that speed at the higher levels measured by the Cattell test exercises an 
appreciable influence on the tests of Memory and Pitch. 

The conclusions to be drawn from Tables II and III may be summarized as follows. 
(i) The general factor in the Seashore battery is related to musical ability on the appreciation 
side (as measured by the Oregon test) and possibly on the interest and performance side (as 
measured by the questionnaire). It is similar to the ability measured by the Wing test. 
(ii) It is, however, also influenced by intelligence, particularly when the test employed takes 
the form of a speed test. 

The next step is to check the analysis by removing the influence of the timed intelligence 
score from the inter-correlations of the six Seashore tests. This was carried out in the usual 
way by the aid of Yule's formula for partial correlation. The results are shown in Table IV. 
A. comparison of Tables I and IV shows that, even after * timed intelligence’ has been 
removed, the pattern of relations remains much the same as before. 


TABLE IV. SECOND-ORDER CORRELATIONS : TIMED-INTELLIGENCE 
PARTIALLED OUT 


Test Pitch Intensity Consonance Memory Time Rhythm 
Pitch .. T — 336 27 36 23 10 
Intensity — .. :36 -— 18 18 26 `08 
Сопзопапсе.. 27 “18 — 43 24 —12 
Memory ae -36 18 43 o — 19 39 
Тїте.. ks 23 :26 -24 “19 — 01 
Rhythm t “10 08 = 39 ‘01 — 


(i) The Bipolar Factor. The bipolar factor can be more briefly dismissed. 


The saturations are hardly at all affected by partialling out intelligence. On comparing the 
saturations with the same external characteristics as before (see Table V), only in the case of age and 
the cancellation test can any appreciable relation be discerned. 


TABLE V. THE BIPOLAR FACTOR AND THE NON-MUSICAL TESTS 


Seashore a Cattell Cattell 6 Cancella- 

Test Factor П (timed) (untimed) Age Interest tion 
Rhythm — .. —-45 47 19 E —-05 16 
Memory —:42 32 18 —-05 34 15 
Consonance .. “16 03 “00 00 23 12 
Pitch .. a 18 22 319 ۰02 39 ۰00 
Time... " 26 13 15 17 06 —-03 
Intensity — .. 27 di 325 7:03 40 02 
Average* .. :29 T7 16 :07 25 08 


The bi olar factor is best interpreted as a factor of classification. It appears to divide the 
Seashore tesis into three subgroups : as will be seen from column 1, the first two tests have high 
-significant positive saturations, the last two much higher 


are probably quite significant. The first two comprise Rhythm and 


of Memory. Since the material seems SO obviously mean 
by musicians (21) as the best in the battery. However, 
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not altogether bear out this verdict: for, although Memory yields the highest figure, Rhythm has 
the lowest. The second group includes Pitch and Consonance. These are tests involving an 
immediate comparison of elementary stimuli, in both cases musical tones or combinations of tones. 
Next to Memory these two tests had the highest correlations with other musical tests (Table I Ds 
The third group includes Time and Intensity. Here the task again requires an immediate comparison 
of elementary stimuli—clicks in the one case and the sounds of a buzzer in the other. _ This group 
therefore appears to depend chiefly on stimuli of a non-musical nature. The results might perhaps 
be tentatively summed up by saying that the bipolar factor appears to contrast tests involving the 
immediate perception of a change with those involving the immediate memory of a change. 


TABLE VI. THE BIPOLAR FACTOR AND THE MUSICAL TESTS 


Seashore Oregon Oregon Orchestral E Question- 
Test Factor II (a) (b) Instruments Ning naire 
Rhythm — .. —45 31 45 18 31 01 
Memory |, —42 67 73 26 68 33 
Consonance .. 16 46 43 14 44 35 
OS бы, 18 47 47 18 61 24 
muera :26 34 23 12 25 03 
Intensity |. 27 27 4l 30 22 3 
Average — .. 29 42 45 20 42 20 


(Ш) Error Factors. The эго 
“error factors 7 is rather high. 
Nearly half of this is contribute 


` general feelings of pleasantness or unpleasantness, 
ions (6, 7). As Burt and others have pointed out, 


Ё у A a supplementary analysis was undertaken by examining the partial correla- 

tions obtained after intelligence, age, and performance in the oA test ka ben eliminated. 
owever, the elimination of the further variables appeared to leave the general pattern of relations 

: unnecessary to report the figures in detail. 

(iv) The Specific Factor. 


R - The factors discussed so far. namely, the general factor, the 
bipolar pn and the factors of unreliability, account for only 62 per cent. of the total 
Fonero Г е remaining 38 per cent, must therefore be explained by specific factors, i.e. БУ 
actors which are more or less peculiar to each of the six Seashore tests. 
ү решо ео may perhaps ? be inter 
in the tests, so that a comparison with tests of musica ili i i i se devise 
by Wing, should er consider ble light on Hae olig, ile p particular with the 
Wing's tests a second bipolar factor, almost as large as the fi i ; in this case 

i 4 first, was obtained ; and in this. 
both bipolar factors appeared to be fully Significant. Hence sily about 12 D. cent. of the variance 
* The objection of * atomism,’ urged by Burt and others agai t turn 
PRESA ainst t tt does no 
solely on the fact that the tests measure highly specific уыз 7T 3555 i they measure 
only the more elementary constituents (which, of course, tend also to be highly specific). Such pro- 
cesses, so these writers argue, are necessary but not sufficient. Their contention is that “ musica 
appreciation as such is concerned far more with the complex Gestalt-like qualities of musical composi 
tion than with the mere discrimination and memory of pitch, intensity, basic rhythms, etc.” 
, 
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in Wing's battery remains attributable to specific factors. Wing's battery, therefore, is not quite as 
atomistic as Seashore’s. But if, as most factorists seem agreed, musical ability is, in Burt’s phrase, 
“ highly complex," some degree of specificity or atomism would seem desirable. As has often been 
stated, “ the most economical type of test-battery is one in which all the tests have a high correlation 
with the general factor, but for the rest have a low correlation with each other.” 

Final Analysis. Опе of the aims of the present investigation was to compare the Seashore 
battery with that of Wing. In all the Wing tests used in the present research the piano is the 
source of the stimuli ; and passages or chords with some musical significance are employed. 
A factorial analysis of the scores obtained by the same group of subjects with the seven 
tests devised by the Wing battery shows that, with his tests, the general factor accounts 
for as much as 45 per cent. of the total variance ; and, as we shall see in a moment, it is 
possible, by an appropriate weighting of the Seashore tests, to secure a multiple correlation 
of 0:72 between the two batteries. It may therefore be concluded that Wing’s tests measure 
much the same kind of ability as Seashore’s, but measure it at a higher or at least a different 
level, namely, that of musical meaning. The results of the entire analysis for both batteries 


are shown in Table VII. 


TABLE УП, A COMPARISON OF THE FACTORIZATION OF THE WING AND 
SEASHORE TESTS 
The figures indicate the proportion (expressed as a percentage) 
contributed by each factor to the total variance. 


گے 


| General " 
Tests | Musical ылы Eo Specifics Unreliability 
| Ability " 
Wing "- 45 10 8 12 25 
Seashore hie | 29 10 0 38 23 


________ || me MM 


TABLE VIII. FINAL ANALYSIS OF THE SEASHORE BATTERY 
ڪس‎ SS e ۾‎ 


Factors 
Tests = 
| Gn) G(i) B Sp Е 
Memory = = ы alg 32 31 39 26 
Pitch .. Ке M" v aa "59 22 = 65 29 
Intensity ts s me x 46 “13 —:26 eT 33 
Time `.. d as Wa s -39 Ө = 9 
Consonance .. ss xA ХЕ 43 1 —. E : 
Rhythm.. ° "s n 83 27 17 52 -62 41 
1 
Percentage of Variance 25 4 10 38 23 


The final analysis of the Seashore tests is set out in Table VIII. The first four columns 
are based on the analysis of the partial correlations in Table IV. G(m) denotes that part 
of the general factor attributable to musical ability as such; G(i), that part attributable to 
intelligence ; B, the bipolar factor ; Sp, the specific factors ; and E, the factors of error or 
: nplete picture of the battery can be built up, enabling 


unreliability. From these figures a com ‹ r nab 
us to BRA in precise factorial terms both the claims made on its behalf and the criticisms 


urged against its shortcomings. | Гей 
(b) Multiple Correlations with External Criteria. The preceding results were obtaine 
when each oF the six tests of the battery was allowed the same Weight. The conclusions 
drawn, however, show that the value of the several tests as indications of musical ability 
varies widely from one test to another. The Consonance test, for example, is apparently 


137 


Seashore's Measures of Musical Talent 


devoid both of validity and of reliability, and therefore should probably receive no weight 
whatever. Indeed, in his Revised Measures of Musical Talent (which unfortunately were not 
obtainable when the above experiments were planned) Seashore himself has dropped it. 
What we now want to know, before using the test for practical purposes, is (1) how would 
the correlations be altered if differential weights were employed in combining the Scores. 
and (2) what would be the highest correlation obtainable, if we used the best possible weights ? 


As regards weights, the requisite regression coefficients, calculated in the usual way, are 
shown in Table IX below. 


TABLE IX. REGRESSION COEFFICIENTS FOR THE SEASHORE TESTS 


Test Pitch Intensity Consonance Memory Time Rhythm 
Oregon (a) .. 17 02 21 42 14 13 
Oregon (b) .. 12 21 21 44 =402 25 
Wing ge 39 —-08 18 39 04 12 
Questionnaire *03 -08 18 :20 21 —:07 


nterest ا‎ 20 "33 23:02 33 
Intelligence .. -08 19 —-09 
Age .. zT 04 —-08 —'08 ‘01 


The multiple correlations are given in Table X, and reveal with greater clarity the 
relations between the Seashore battery as a whol 


ashore t e and the other tests employed. The 
Squares of these correlations indicate the proportion of the variance contributed by the item 
in question. 


TABLE X. MULTIPLE CORRELATIONS AND VARIANCES 
—————————————ÀÀ 


Test Correlation Variance 
Oregon I Ex “75 55 
Oregonll  .. -82 67 
Wing .. - “72. -62 
Questionnaire. . -48 23 
Interest e 45 21 
Intelligence .. :32 10 
Age .. - —03 -00 


IV. SUMMARY AND CONCLUSIONS 

1. Seashore's tests of musical ability were applied to 100 undergraduate and 
postgraduate students; and an attempt was made to assess the validity of each 
component test both by intern ia (i ini : 
with the factors entering into all) 
tion of each test with independen 
more especially supplementary tests of musical and 

2. The inter-correlations were factorized by Bu 
and, since reliability coefficients can readily be cal 
results eventually obtained may be conveniently ex: 
type of hypothesis.’ They reveal (i) 
the variance, (ii) a bipolar factor ac 
accounting for 38 per cent., and (iv) e 


rs method of simple summation ; 
culated for the separate tests, the 
pressed in terms of his * four-factor 
а general factor accounting for 29 per cent. of 
counting for 10 per cent., (iii) specific factors 
rror factors accounting for 23 per cent. 
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3. The interpretation of the several factors was based partly on the introspective 
reports secured from the subjects on the mental processes used in performing each 
of the Seashore tests, but mainly on the external evidence obtained from the supple- 
mentary tests. The following conclusions were drawn :— 

(i) The saturations for the general factor were first systematically compared with 
the correlations obtained between each test in the Seashore battery and the tests or 
assessments of both musical and non-musical characteristics. It was found (a) that 
age and (b) musical knowledge have little or no influence on the general factor ; 
(c) that intelligence (especially when assessed by speed of performance) has a small 
amount of influence ; and (d) that performances in the Wing and Oregon tests show 
the closest agreement. It may therefore be inferred that the general factor is closely 
related to musical ability and appreciation. 

(ii) The bipolar factor is evidently a classification factor. It appears to sub- 
divide the six Seashore tests according to their concrete content, contrasting (а) those 
that depend chiefly upon immediate discrimination with (5) those that depend chiefly 
upon immediate memory. 

(iii) The specific factors for Pitch, Intensity, Time, and Rhythm are decidedly 
large ; those for Memory and Consonance much smaller. It is suggested that the 
saturations for these factors may be taken as indicating in some degree the ° atomistic 
nature of the battery, i.e., the extent to which it depends upon a number of specific 
and mutually independent abilities. 

(iv) The unreliability of the battery as a whole appears to be rather high. But 
this is due mainly to the extreme unreliability of the test of Consonance. ^ 

4. On these and other grounds it is argued that the common criticisms of the 
Seashore battery, as expressed by Wing and others—namely, that it is invalid, 
irrelevant, excessively atomistic, and musically meaningless—are not altogether borne 
out by the results obtained. A comparison of the Seashore tests with those of Wing 
suggests that the two batteries measure much the same general musical factor, but at 
different levels and from rather differen&aspects. Тһе lower validity of the Seashore 
battery seems due largely to its greater specificity, and might be easily improved by 
omitting the test of Consonance and modifying the speed at which the test-items are 
presented. 

5. ]t is concluded that, in its general nature, the Seashore battery is adequate 
for its original purpose, namely, to measure the more elementary abilities required for 
the understanding and appreciation of music ; but that its use will be most effective 
if the scores are weighted in accordance with the calculated regression coefficients, 
and if it is used in conjunction with other tests of musical appreciation. 
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NIGHT VISUAL CAPACITY AND ITS RELATION TO 
SURVIVAL IN OPERATIONAL FLYING 


By D. D. REID 


London School of Hygiene and Tropical Medicine 
I. Problem and Method. П. Results. III. Summary and Conclusions 


I. PROBLEM AND METHOD 


In his Moynihan lectures, Livingston (1944) has described the development and 
use, in the Royal Air Force during the second world war, of his Rotating Hexagon 
Test for night visual capacity. Its introduction as a selective method was justified 
largely on a priori grounds, although early results suggested that success in the test 
was predictive of success in action. Thus the scores for night visual capacity obtained 
by a group of successful night-fighter pilots were appreciably above the average of 
flying personnel in general (Livingston, 1948). 

The reliability of the test, and its relevance to the performance of tasks of identification in 
conditions of low illumination, has been fairly thoroughly studied (Bradford Hill and G. O. Williams, 
1943). But without clearer indications of the operational importance of testing for night vision, some 
degree of opposition to the introduction of the Hexagon Test as a method of selecting or excluding 
men hoping to go forward for flying training was almost inevitable. Moreover, with the increased 
use of radar aids in aerial gunnery and in landing, ability to see at night became less important during 
the later stages of the war. Nevertheless, it continues to be relevant both to flying and to other 
Occupations in peace time ; and this would seem to justify the publication of the following results 
obtained during an attempt at securing an operational validation of the Hexagon Test. 

The results of the Hexagon Test of night visual capacity, which was usually applied at 
an early stage in training, were recorded on each man's medical documents ; these accom- 
panied him throughout his service and were available to the squadron medical officer. In 
the course of an investigation already reported (Reid, 1947), the records of all the operational 
crews on active service in Bomber Command of the Royal Air Force were examined and the 
N.V.C. (Night Visual Capacity) scores were noted, together with the number of bombing 
Sorties completed by the individual by the date of the survey. Then, by producing frequency 
distributions of N.V.C. scores for men who had completed 0, 1, 2, 3, or more sorties up to 
the usual tour limit of 30 sorties, it was easy to calculate the mean or average N.V.C. score 
among groups of men at different stages of their operational tour. If night visual capacity 
was important in determining the length of survival, i.e., if a selective elimination of those 
with poor night visual capacity took place by their being shot down early in their tour, then 
this should be revealed by a progressive rise in the average values of the N.V.C. scores among 
those who survive to complete most or all of the tour expected of them. If, on the other 
hand, ability to see at night was quite unimportant in determining survival, the level of the 
mean N.V.C. scores in groups of increasing lengths of operational experience should remain, 
within sampling limits, unchanged. ў 

It is clear that night vision is unlikely to be equally important to the wireless operator, whose duty 
never allows him to use that faculty at all, as it is to the gunners, upon whose speedy appreciation of 
an impending attack by an enemy night-fighter the safety of the aircraft may well depend. Separate 


tabulati а for each type of crew duty. 

Cr Pues of DO skill, mechanical efliciency and all the hazards of weather 
and war which are involved in a bomber crew's chance of survival, it would be idle to expect any 
very close relation between an individual's night vision test score and the number of sorties he sur- 
Vived. Consistent rather than spectacular trends must be looked for ; and, even where these trends 
are clear enough, the comparison of such trends in the various types of crew duty demands the use 
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of some statistical method of describing them, and of testing whether apparent differences are larger 
than could reasonably be due to chance. For these reasons, regression lines have been fitted to the 
data for each type of crew duty to describe the trend, if any, of the mean night visual capacity score 
in groups of men of differing operational experience. The significance, in the statistical sense, of 
these trends and of the differences between them can then be readily assessed by the appropriate 
analysis of variance technique. 


II. RESULTS 


The N.V.C. scores were grouped according to the number of bomber operational sorties 
carried out by the individual up to the time of the survey (0, 1-4, 5-8, etc.) to give series of 
roughly equal size. The mean N.V.C. scores of the men in each of these ‘ experience- 
groups,’ divided according to the type of aircrew duty performed, are set out in Table I. 

In Table I the totals certainly show no very dramatic rise in mean N.V.C. score 
in groups of successively longer operational experience. In navigators and wireless operators 
no clear trend is apparent, but for mid-upper gunners, on the other hand, there is a consistent 
rise in mean score with each increase in the number of sorties carried out by the date of the 
survey. To study and compare these differences more precisely, the correlation coefficient 
between N.V.C. score and operational experience measured in individual sorties was calcu- 
lated and a straight line fitted to the data to represent the change in mean N.V.C. score 
with each increase in the number of sorties survived. The regression coefficients (N.V.C. 
on number of sorties) whose sizes indicate the slopes of this line for each crew duty are set 
out in Table II. 

The Total row in Table II shows that there isa very small, but quite definitely significant, positive 
relationship between the N.V.C. score at initial examination and the number of sorties survived ир: 
to the time of the survey. Further, the apparent progressive rise in mean N.V.C. score with in- 
creasing operational experience, such as could have occurred by chance less than once in a thousand 
times, is best represented by a line whose slope is such that the mean score rises by ‘0516 units between. 
groups of men with 1, 2, 3, etc. sorties, to their credit. 


TABLE П. RELATION BETWEEN N.V.C. SCORE AND OPERATIONAL 
EXPERIENCE 


Correlation Regression Critical 
Duty Number Coefficient сон Ratios 
n t 

Bits, te a2 m 622 | 0825 0646 | 20604* 

Navigator .. - ae 622 +0365 +0267 0-9106 
Bomb-Aimer š sot 681 -1350 -0939 3:5502** 

Wireless Operator... 854 0358 0268 1-0459 
Flight Engineer — 352 1125 0927 2-1185* 
Mid-upper Gunner: hoo 939 0878 0593 2.6992** 

Rear Gunner Vs v 986 "0260 ۰0172 0:8163 
Total — .. 5,056 0715 0516 5:0974** 

* Significant: P « 05 ** Highly significant : P < ‘01 


is trend can be adequately described by a straight line since the residual 
As Table TII shows, this ES E from the ‘error’ (in this case the within-array) 


Variability left after fitting it differs insignificantly 1 < C c 1 
variances The same holde for the trends of N.V.C. score in successive groups of increasing operational 
experience in the specific aircrew duties where the trend is significantly upwards. The regression 

lines for the four main groups, based on the figures in Table I, are drawn in Fig. 1. : Е 
As the tests of significance in Table II suggest, the trends of these straight lines are upward for 
pilots, bomb-aimers, flight engineers and mid-upper gunners. For wireless operators and MA 
there is no such consistent trend : in other words, there is no evidence of a possible rues imina- 
tion in action of wireless operators or navigators with below average N.V.C. scores. This, of course, 
duties is night vision important. 


is what might have been expected, since in reither of these aircrew 
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TABLE Ш, ANALYSIS OF VARIANCE 


GAM—M—————— —— ———— ج‎ cO с —À 


Source Sum of Squares D.F. Mean Square F 
PrLors 7 s 
Residual 872-3152 29 дов өз 
Regression .. 175-7040 1 34.9340 
Between arrays 1,048-0192 30 d 
Within arrays 24,194-9392 591 41:9542 
Total .. 25,842-9584 621 — = 
NAVIGATORS 2 
Residual 773-3008 29 26:6655 < 1 
i 29-9280 1 29-9280 < 
Regression .. 
Between arrays 803-2288 30 26:7743 
Within arrays 22,606-4688 591 36:5592 
Total .. 23,409-6976 621 = ые 
BOMB-AIMERS 
Residual 1,208-8896 29 41-6858 1 Mee 
Regression .. 391-0592 1 391-0592 2 
Between arrays 1,599-9488 30 53:3316 
Within arrays 19,855-4640 650 30-5469 - 
Total .. 21,455-4128 680 — Ex 
WIRELESS OPERATORS 1 
Residual 937-2624 29 32-3194 = 9 
Regression . . 39-7536 1 39-7536 10 
Between arrays 977-0160 30 32-5672 
Within arrays 30,031-2272 823 36-4899 a 
Total .. 31,008-2432 853 — pan کا‎ 
FLIGHT ENGINEERS 1 
Residual 1,215-4224 29 41-9111 mu 
Regression .. 205-7056 1 205-7056 4 
Between arrays 1,421-1280 30 47-3709 
Within arrays 14,830-3264 321 46:2001 
Total .. 16,251-4544 351 — page 
MID-UPPER GUNNERS 
Residual 1,010-5504 29 34-8466 Tos 
Regression .. 212-5984 1 212-5984 T2 
Between arrays 1,223-1488 30 40-7716 
Within arrays 26,344:5280 908 30-3609 -— 
Total .. 27,567-6768 938 "= 3 
REAR GUNNERS 
Residual 909-5888 29 31-3651 1-01 
Regression .. 19-8752 1 19-8752 si 
Between arrays 929-4640 30 30-9821 
Within arrays 28,434-4144 955 29-7743 pe 
Total... 29,363-8784 985 = = 
di dual 929-6608 1 
Residual : 29 32-0573 ee 
Regression .. 895-0416 1 895-0416 26:00** 
Between arrays 1,824-7024 30 60-8234 
Within arrays 173,136-5120 5,025 34-4550 m 
Total .. 174,961-2144 5,055 = 
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Conversely, for mid-upper gunners, the implied importance of night visual capacity agrees with 
reasonable expectation since their part in ‘spotting’ enemy fighters might well be decisive for the 
aircraft's defence by gunnery or evasive manœuvre. Similarly, as Bradford Hill and Williams have 
shown (1943a), the pilot may require good night vision for landings at night under operational 
conditions. It is conceivable, too, that the bomb-aimer and flight engineer may from their alternative 
Positions in the front gun cockpit and astradome play a part in the ‘ spotting ’ of enemy aircraft which 
would explain the relevance of night vision to their survival; but the negative findings for rear 
gunners makes such an explanation less convincing. 


As Table II shows, there is no significant upward trend in N.V.C. for the rear gunner category. 
For this there seem to be two possible explanations. First, it may be that night vision is unimportant 
in the defensive role played by the rear gunner; but the positive inferences noted for mid-upper 
gunners render this unlikely. Alternatively, there remains the possibility that the initial selection of 
men with superior night vision for employment as rear gunners (implied by their higher mean N.V.C. 
score before commencing the tour) did, in fact, eliminate men whose defective night vision might have 
proved a danger to the aircraft in which they flew. 


TABLE IV. ANALYSIS OF VARIANCE 


Source ` Sum of Squares DE. Mean Square F 
Deviations from own ‘duty’ 
regression .. Те b 172,824-6992 5,042 34-2770 — 
Between ‘ duty’ regressions .. 179-5808 6 29-9301 — 
Deviations from common re- 
gression .. M 0 173,004-2800 5,048 34-2718 — 
Common regression .. a^ 895-0416 1 7 895-0416 2612** 
Within duties .. zs T 173,899-3216 5,049 34-4423 — 
Between duties .. Es is 1,061:8928 6 176:9823 5.14** 
Total .. ers ay 174,961-2144 5,055 — ER 


ae s a n elec 
** Highly significant: P < ‘01 


The complete analysis of variance for the data isset out in Table IV. From this table 
it appears that the differences between the mean score in each crew category are significantly 
large—a reflection of the policy already noted of selecting those with better results on the 
Hexagon Test for aircrew duties which, on a priori grounds, appeared to demand superior 
night vision. Then, looking within the various types of duty, part of the differences between 
the individual scores recorded is clearly accounted for by the regression, common to all 
duties, between night vision and the number of sorties completed. The variance ratio test 
Shows that the mean square due to this regression is significantly greater than the estimate 
of random or * error’ variation given by the deviations of individual observations from this 


common regression line. 

The regression line describing the upward trend of mean N.V.C. score with increasing 
experience may, of course, have a different slope among groups of men with differing crew 
duties. Indeed, the slope of such trends might be taken to indicate the relative importance 
of night visual capacity in determining the chance of survival in different duties. A steep 
slope (such as that shown in Fig. 1 for bomb aimers and flight engineers) would imply that 
the upward trends in mean N.V.C. score are due to a high rate of elimination of men with 
inadequate night visual capacity. | TA E 

It ing to speculate on the possible explanations for the suggestive c isparities 
between. Rede poen mid-upper gunners and flight engineers. That such speculation would 
be premature, however, is evident from the top section of Table үне e E 
slopes i М ie within the limits of * error’ variation. n the whole, this ov alysi 
amplifies ао the deductions already drawn. It seems evident that in certain types of duty, 
i imers, flight engineers, and mid-upper gunners, night visual capacity 


ге 08 i b-ai and р расї 
яз Mese epu im Test was significantly, if slightly, related to the time of survival in 
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operational flying. The general course of the upward trend in N.V.C. score seemed to hold for all 
four types of duty since the differences between them were insignificant, and there was no evi к 

that this trend was other than linear. On the other hand, although the absence of trend amo g 
‘inside’ duties (navigators and wireless operators) agreed with reasonable expectation, it was a Ls 
absent for rear gunners for whom night vision had been considered to be of prime impor En S 
However, as suggested above, this last feature of the data seems in all probability to be the resu «| 
Selection. There remains the question—is the general correlation between Hexagon score and 
duration of survival really the expression of a direct causative relationship between night visua 
capacity and operational skill ? 


For the phenomena observed, several explanations may be advanced. It might be that the 
upward trend in N.V.C. score with increasing operational experience was the result of a relaxation 
of standards laid down at initial selection. Such a relaxation would presumably result in the more 
recent recruits to a squadron's strength having a lower average N.V.C. score than their more ex- 
perienced colleagues. Yet not only is there no record of any such change in policy having taken 
place, but also such a change would have produced a fairly sudden change in the level of the NV. e 
Score rather than the consistent linear trend actually observed. Again, a change such as the comple е 
neglect of N.V.C. testing in selection in general policy would have been expected to have some effec 


on the trend in a/l types of duty. In fact, as already noted, the trend was conspicuously absent among 
the ‘ inside ’ members of the bomber crews. 


22 — PILOTS 


----— MID UPPER GUNNERS 


2l BOMB 'AIMERS 


ہے 
FLIGHT ENGINEERS‏ — — 


20 


MEAN NIGHT VISION TEST SCORES 


0 5 10 15 20 25 
OPERATIONAL EXPERIENCE IN SORTIES 


Fic. 1.—Relation between night vision and operational experience. 


. _ Yet another explanation would be that the Correlation between Hexagon Test score and operas 
tional survival depends not on any direct cause and effect link but on SUE common relation to а thir 
factor, presumably some personal quality or ability which is really effective in determining © 
individual's chance of completing his operational tour. Intelligence and neurotic predisposition т 
be two such characteristics. If, for example, intelligence, as measured by the usual paper and ре 
tests is positively related both to the results of the Hexagon Test and to ts number of sorties likely 10 
be survived, then any selective elimination in action of the less intelligent would result in a simu 
taneous reduction in the number of men with below average Hexagon Test scores. Similarly, there 


146 


D. D. REID 


is some evidence that neurotic predisposition as assessed by psychiatric interview is related to opera- 
tional or flying efficiency (Reid, 1942, Davis 1948). There is also evidence which suggests that 
patients suffering from neurotic illness do rather poorly at the Hexagon Test (Livingston, 1945). 
It does not follow from this, of course, that these men would have done poorly in the test before the 
onset of their illness, c.g., at initial examination before exposure to operational stress—their per- 
formance may be a sign of their acute illness rather than a permanent feature of their native abilities. 
The possibility that there is some such underlying explanation for the results observed must, however, 
be investigated. 

. Some light on the problems involved may be obtained by examining the results of 
intelligence tests, night vision capacity tests, and psychiatric interviews conducted in the 
course of another investigation on a large series of pilots under training. Considering the 
nature of the Hexagon Test, it would not be surprising to find that some of the factors in- 
volved in intelligence tests of a conventional kind might have an influence on the results 
achieved. As it happened, the intelligence tests administered in this particular experiment 
were divided into a test of form perception (V.C.), a test of verbal intelligence (I.M.A.) and 
a test of mathematical reasoning (C.M.). As Table V shows, none of these sub-tests shows 
any significant relation to the results attained by the same individuals in the Hexagon Test. 
Again, results of a general test of intelligence gave similar negative findings on correlation 
with Hexagon Test scores (Parry, personal communication). It follows, then, that the 
correlation of Hexagon score with operational survival does not depend upon any coincident 
correlation with intelligence in so far as intelligence is measured by the tests used. 


TABLE V. CORRELATION OF HEXAGON TEST AND INTELLIGENCE TESTS RESULTS 
_ —————— 


Test | r n a t 
XC. s | 0735 367 1-4077 
LM.A. | -0009 367 0000 
см. . | “0419 365 | 8091 


Note : all coefficients are insignificantly different from zero. 


In the other investigation mentioned (Symonds and Williams, 1948) neurotic predis- 
position was assessed by two psychiatrists who, on the basis of a biographical interview, 
Classified the men examined into four categories of predisposition to the risk of breakdown— 
nil, slight, moderate, and severe. The N.V.C. scores given by the Hexagon Test for the 
individuals thus classified were then divided up according to the usual conventions— 
Excellent (30-32), above average (20-29), average (9-19), below average (3-8) and poor (0-2). 
Contingency tables, summarizing the relations existing between broad groupings of these 
categories both of psychiatric assessment and Hexagon Test results, are given separately 


for each psychiatrist in Table VI. 
For both psychiatrists, although the trend in results is suggestive, there is no significant 
relationship between assessment and N.V.C. score : (for Psychiatrist A, y^ = 2:3877, n = 2, 
50>P>-30; for B, X = 14791, n = 2, 10>P>°50 ; together, a 3:5668, n = 4, 
‘50>P>-30). In so far as these results give a fair picture of this relation, then there is no 
reason to suppose that the observed trend in mean N.V.C. score with increasing operational 
experience is really dependent on the part, if any, played by neurotic predisposition as a factor 
in survival. The very slight relation appearing in the table may be enough to explain the 
insignificant rise in N.V.C. score which takes place in all duties with increasing experience. 


i igni i Vac. 
To sum up, the evidence suggests that the significant upward trend in mean N 
Score found inall but one of the “outside ' types of bomber-crew duty is an expression of 
Some causal relation between the abilities detected by the Hexagon Test and the number of 
Sorties survived, i.e., that night visual capacity, irrespective of any association with either 
intelligence or neurotic predisposition, is of definite importance in operational flying. 

As the low correlation coefficients between N.V 


.C. score and length of survival suggest, the 
importance of night vision, relative to all the other factors involved in surviving the many-sided 
hazards of operational flying, is com 


paratively small. Indeed, less than 1 per cent. of the total variance 
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i i in ni i i e the 
i th of survival is accounted for by differences in night visual capacity as measured by th 
mens Test. But this is hardly surprising. Quite apart from the many factors involve 
operational survival, there is a close interdependence between all members of a crew ; en E. 
combined with the fact that the Hexagon Test was used to eliminate those with more severe ч OR cs 
in night vision, must tend appreciably to dilute the correlation between test result and lengi ae 
survival. It is indeed a little unexpected that any such correlation could be demonstrated in opera: 
data of this kind. 


TABLE VI. RELATION BETWEEN PSYCHIATRIC ASSESSMENT AND N.V.C. SCORE 


— 


N.V.C. Score 
Psychiatrist A's Above Average Below Average Total 
Assessment )8-32( (0-7) 
ОШ p e 34 (48-57%) 36 (51-43%) 70 
Slight Кы м” Da 32 (42-10%) 44 (57-90%, 16 
Mod. and severe 7 (30-43%) 16 (69-57%) 23 
73 96 169 
Psychiatrist B's Above Average Below Average 
Assessment (8-32) (0-7) Total 
ie Gee с. 32 (39-02% 50 (60-98%) 82 
Slight .. oe uc Же 23 (33-82%) 45 (66-18%,) 68 
Mod. and severe Ug 6'(27:27%) 16 (72-73%) 22 
61 111 172 
n ees | eee 


, The operational validation of tests of all Sorts in the Services or in industry is expensive in s 
of time, labour, and money. Theoretically. validation requires the initial testing of all subjects W d 
Subsequent career in the chosen pursuits is then followed up. But in circumstances where Dew 
tests are being introduced, there is Probably no entirely satisfactory substitute for this approac 5 
Where the need is for a rapid assessment of some personal fact (e.g., a physical measurement О! 


biographical detail), some variant of the method used in this study may be useful, particularly 1 
only current personal records are available. 


III. SUMMARY AND CONCLUSIONS 

1. In the hope of assessin the im 

by the Livingston H 1 
statistical analysis was 


nce. Significant upward trends suggestive O 
and the length of operational surviva 
embers of a crew (pilots, bomb-aimers, 
who had occasional look-out duties, bU 
members of the crew, viz., navigators ап 
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3. The association noted in these cases does not appear to be the result of any 
correlation between night visual capacity and intelligence or between night visual 
capacity and intelligence and neurotic predisposition. There was no demonstrable 
relation between the scores obtained with the Hexagon and the results of testing of 
intelligence or neurotic predisposition. The unexpected result in rear gunners may 
be explained by the rigorous selection standards on the Hexagon Test demanded for 


rear gunners. 
4. The general conclusion is that, whatever its precise mechanism, night visual 
capacity as measured by the Hexagon Test had a definite, if small, influence on 


survival in operational flying. 
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PRIMARY FACTORS OF PERSONALITY 


By H. A. REYBURN and M. J. RAATH 
The University of Cape Town 


I. The Experiment. ЇЇ. The Centroid Analysis. III. The Orthogonal Analysis: 
IV. The Oblique Analysis: (a) Orthogonal Projection; (b) Parallel Projection. 
V. The Factors. VI. The Interrelations of the Factors. УП. Summary. 


I. THE EXPERIMENT 


The purpose of the experiment discussed in this paper was to reconsider and 
extend the results of factorial analyses of personality qualities already made by 
(9, 10, 13). A larger range of qualities has been dealt with, the choice being governe 


Er oy inl experience and partly by analyses published by others, particularly 
Cattell (5). 


There appear to be three main methods of dealing with the axes which a centroid 
analysis yields. 


First, the axes, as they stand, may be taken to represent intelligible factors. The resulting 
general factor and subordinate bipolar factors are then regarded as concordant with the deman 
logical definition (1). Against this must be set an important and perhaps decisive objection. 


factors thus reached are relative to the battery of tests or variables, and thus lack objectivity (11, 
p. 397, and 12, p. 55). 


" " we 
Secondly, the axes may be rotated to find simple structure. For reasons given elsewhere v 
are not satisfied with this s 


т r 3 ji t 
1 s solution (12, 14) ; and in practice it does not seem to give the objecti 
claimed. Indeed, the minimum conditions of simple structure are not always rigidly insiste 3 
ang ne for the supposed unique result turn out, in fact, to be a matter of degree (16, PP- 

an 5 


i ucl ight‏ = ر 
Thirdly, factors may be identified positivel and axes drawn through guiding points in the lig‏ 
of all available knowledge (12, pp. Sot"; cf. is, рр. 287f.). Тһе ipu is then the intelligibility‏ 
of the analysis as a whole and its concordance with other results. Certainty and objectivity EM ББ‏ 
to be expected at first. Factors which will remain objective in all relevant circumstances mus' ay‏ 
sought gradually and by a method of approximation. It is our hope that this further paper mà‏ 


contribute to this process. 

For the present enquiry two groups of students (37 men and 46 women) acted as 
observers ; all had some acquaintance with psychology, and most were advance 
students. Each was asked to select two subjects, approximately of his own age, ап 


thoroughly well known to the Observer. The total number amounted to 160, VIZ., 
men and 82 women. About half were p 


were fairly evenly balanced. In a few c 


d candour. A questionnaire was then filled 


In Cattell’s analysis bipolar qualities were fre uently used : і ently assumed that 
psychologically, if not logically, the opposites were conto der s pir d pri gy We were 
unwilling to make this assumption. Accordingly, the negative was taken to be the absence or extreme 
infrequency of the quality. It was left to the analysis itself to show if contraries could be treated aS 
contradictories. 
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In administering the questionnaire the two groups of observers were taken separately. Each 
Observer was given a typed copy of the questions and one of us went over the items with the whole 
group, giving further explanations. Each item was answered by everyone before the next question 
was considered ; and stress was laid on the need of keeping the answers independent. There was 
one difference between the two groups. ltem 26 deals with the level of ideal which the subject 
demands. The first group answered the question in a general way ; the second divided it into two, 
reporting separately on the standard which the subject exacts from himself and on that which he 


exacts from others. 
, The ratings for each pair of qualities were correlated, first for each of the two groups of 80 
subjects, and then for the full 160. The results served as a check on the reliability. In addition the 


answers were scrutinized in the light of the sketches already made. 


II. THE CENTROID ANALYSIS 


In making an analysis by simple summation or the so-called centroid method 


there is no infallible criterion to determine the number of factors to be extracted, and 
in any given experiment more may be present than the accuracy of the data allows 
one to extract. Weight must be given to the probability at each stage that the residuals 
can be explained as chance variations. This criterion was adopted by Reyburn and 
Taylor in analysing some of Webb's data (13, p. 157) ; but theoretical and practical 
considerations have since induced us to think that the analysis should be carried 
slightly beyond the stage thus indicated. In general it is desirable to bring the average 
residual below the probable error of a zero correlation ; and attention should be 
paid to the rate at which the average contribution to the variance diminishes. 


In the present experiment, after five factors had been extracted, the average resi- 
dual was ‘051 ; the P.E. of a zero correlation for 160 cases would be + :053. One 
more factor may perhaps be extracted. When this is done, the average residual falls 
to ‘046. The average amount of variance extracted by each successive factor is (in 
percentages) as follows: Factor I, 14-8; П, 10:5; Ш, 10:9; IV, $54; V,32; 
VI, 2:2, The saturations obtained for the several traits are shown in Table I. These 


may be called Fo. 

The test of a successful rotation is the intelligibility of the factors obtained. Each should be 
a unitary function, and not a mere aggregate of accidentally associated qualities. It is difficult to 
rotate successfully if nothing is known beforehand of any of the factors. If, however, it is known 
that certain factors are present, the task is simplified, and it may be possible to establish new factors. 
In the method we adopt it is thus necessary to begin with some hypothesis regarding at least a number 
of factors, and the first and immediate test of its adequacy is the intelligibility of the set of factors as 
a whole. The nth factor must be as intelligible a unity as the first. : 

No single experiment, however, can finally establish the objectivity of any factors. Objectivity 
becomes finally manifest only when we can assume a set of factors in a variety of contexts and find 
On each occasion that the remaining factors also prove significant. The ultimate test is the coherence 
and intelligibility of the factors through the widest range of experimental material. In searching for 
factors which are objective in this sense, one must not expect to find them with high precision at the 
Outset. If some variable is known to have a heavy load on a factor, it can be used as a guide to 
determine the best position of the axis. But the guide will not be perfect. The final position for 
the axes can only be approached gradually and in a series of approximations. In making an analysis, 
therefore, it is important to use past experience as well as present data. | 

The first decision to be made is whether the rotated axes are to be orthogonal or oblique. There 


are obvious advantages in favour of the former: the system is simpler, and the independence of 
the factors is a АП able asset. With the latter the angles between the axes are a part of the total 
picture ; and whether they are resolved into factors of the second order or not, they add to the 
complexity of the analysis, and thu: 


s seem to offend against the principle of parsimony. 

d difficult to identify. Often they seem 

to other factors has pen te Pod onn 

ical difficulty, it often presents psychological difficulties. urther, 
i is of the greatest importance. The possible load: of the 

p opem Б i tofthem. Thus the art factor 

i i d to have what 

tends to ab aximum loads of the variables, and the last one is only allowed to he 

is left о аро the correlations with the other factors have been deducted. IB objections may 

not be fatal, but they are formidable. And they induced us to make a double analysis, using ot 

orthogonal and oblique axes. 
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III. THE ORTHOGONAL ANALYSIS 


We begin by fixing provisionally points through which the axes may be passed. 
To determine these points we made a survey of the data. The correlations were 
classified according to the significance which could be attached to them. 


TABLE I. CENTROID FACTORS 
= eee 


No. Trait I II ш IV M VI т 
1. Avoids Problems ..  .. 3390 278 —:575 —:200 —-038 -189 | -638 
21 Overbold is: — i; 2 336 —252  .3233 104 —061 -069 | -300 
3. Unsystematic By дз 373 — 084 —:380 —:289 230 128 | -443 
4. Spontaneous.. —..  .. "445 —-529 —.067 -111 -082 101 | :512 
5. Emotional ..  .. ¢. 33, —211 —450 -441 —113 —:182 | :597 
6. Emotionally variable eM 328 —-240 —-093 -027 ۰110 —-092 -195 
7. Emotionally uncontrolled |. "632 —-039 —-071 261 —-122 ۰005 :490 
8. Relaxeseasily — .. — .. 270 —.148 —:224 —-428 -228 008 | -380 
9. Looks on bright side — .. 211 —-518 —030 000 —-065 —-093 | -326 
10. Avoids strong emotions .. | —-503 33 086 —090 —.091 —-118 | -395 
П. Easily distressed .. — .. 142—048 —263 -457 —+188 —-065 | 341 
12. Prone to sad thoughts — .. 064 — 415 —-259 397 057 —-058 | -408 
13. Fits of depression .. — .. 306  .337 —295 303 -158 —273 | -434 
Ih Аус ена C ie I. | 37 302 387 —198 اس‎ 000 445 
15. Feels inferior E - — :050 420 —-456 243 —.032 "391 601 
JOY ы ec езана сй б copes 0| 467 
ase sw. „ еј 322 118 630. ооо 134 | 509 
18. Submissive»... — .. | —309 —144 —570 —125 —060 15s | 485 
19. Subjective outlook .. — .. | 329 185 362 198 000 — 114 | 5336 
20. Fixed in outlook .. .. | —395 — 41457 209 120 2000 X | 337 
21. Suggestible .. |) l 138 —071 —-560 —-149 —453 -201 | 424 
22. Contra-suggestible = s :252 :326 380 115 175 060 :362 
24 compliant. ..  .. | 30 338 мо 020 23 —293 | 5538 
24. Dominating.. | 480 119 472 15 022 261 5551 
Zee diente ee... .. | BIB —180 —22 158 007  12| 29 
2 pue snb o | 5 63 155 —255 ч * 2| 19 
26b. Realistic : Others .. э» *561 208 —:179  —.367 370  —:178 693 
27. Aggressive .. 1 527 311 335 -194 072 -021 | -530 
О ШШЕ а o. | «i339. i553 о 194 :000 —:072 | :439 
29. Self-esteeming |. ^ U 255 —-182 .598 136 055  —-175 | -508 
30. Conceited .. jj Y 310 — 204 300 564 
mda | 874 213; ШЫ 501 
32. Reliable — .. n eo | —556 —-040 -264 :490 
RS ы ш. | = 400 сә 130 563 
34. Cheerful ~ .. e: кы 360 —:700 115 E 654 
25. EE E сы 455 349 036 271 —038 —-156 | 429 
Au kained ° 7563. —5 =1999 ою 020 031 | «538 
oe SO | NS “Л age до йо му | 9 
eee uw D m 214 — 48 — 235 184 089 228 | 396 
40. Sensitive to others . i | 7329 —:469 ~-202 -106 —-101 —-104 | -402 
т о eee | Ru Es I. 
4. Touchy Su os vos 122 -310 258  .544 —.147 -—.074 | 0 
de Bice .. ae ae 001 —.616 029 —-268 ~—-091 —-003 460 
. Blames others id 34 -585 313 .05 3 Ê —.038 *501 
44. Unbalanced .. te vs 426 -282 350 133 E 168 Cah 465 
45. Mixes badly ge s | 99372 — 585 iee —(029 „ 058 —133 | +530 


In the first class were placed all which, i igni t 
or below the -001 level ; in the second Sls tho: e atr ОҒ 160, could be "nie e or 
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:20 level; and all the rest were regarded as devoid of significance. In the analysis attention was 
given almost exclusively to the first two classes. 


. The First Factor. One of the factors isolated from Webb's study on character and in- 
telligence was named ‘ cleverness ' by Garnett (7) and * surgency ' by Cattell (3 and 4). 


In 1933 and 1936 Cattell indicated the close interrelationship between surgency and a number 
of other qualities. The relation was set forth as a bipolar one. This is shown in the following 
list which indicates the more intimate connections. Positive Traits: cheerful; natural; sociable ; 
humorous ; adaptable; gregarious; quick; hasty. Negative Traits: gloomy; formal; un- 
sociable ; earnest; conservative ; exclusive ; slow; introspective. 

In 1939 Reyburn and Taylor found that the following qualities had significant factor loads on 
an axis placed to measure surgency-desurgency. Positive Loads : fondness for large social gatherings ; 
sense of humour ; cheerfulness ; corporate spirit ; bodily activity in pursuit of pleasure. Negative 
Loads: fits of depression ; conscientiousness. In 1941 Reyburn and Taylor dealt with the factor 
from the desurgency end, and found the following loads. Positive Loads : Indulges in self-pity ; 
loses head in excitement ; worries; is easily hurt; prefers working with others. Negative Loads : 
Takes prominent part in social affairs. In 1943 Reyburn and Taylor obtained the following results 
for surgency : Positive Loads: carefree ; happy-go-lucky ; unconcerned about others’ opinions ; 
likes excitement ; unconcerned about future ; also talkative. Negative Loads : over-conscientious ; 
worries ; easily distressed ; thinks before acting ; dislikes being interrupted. 

In 1944 Cattell made a fresh analysis of the ‘ Personality Sphere ’ (4), and obtained a classification 
of traits for the bipolar factor (re-named surgency versus agitated, melancholic desurgency), which 
will be found in his book (5). A third and slightly different arrangement was presented in 1947 (6). 

_ A consideration of these various analyses shows that there are several aspects to the 
activity which is being treated as a functional unity. On the one hand, there is an emotional 
aspect; on the other, a more directly conative one, which may be called responsiveness. 
In our experiment, variable No. 4 (spontaneous) corresponds closely to the latter aspect ; 
the former one seems adequately covered by variables №529, 12, 13, 34, and 42. Т heir 
inter-correlations merit examination ; and are shown in Table II. 


TABLE П. CORRELATIONS BETWEEN VARIABLES CLASSED 
AS RESPONSIVENESS 


No. Variable | 4 9 12 13 34 
9. Looks оп bright side .. | 423 

12. Prone to sad thoughts. . | —156 —-280 

13. Has fits of depression .. —-038 —-237 492 

34. Cheerful.. me ws; | 3989 510 —-418  —:399 


42. Buoyant.. gs os | :349 389 —:434  —:315 25: 


The variables fall into two groups : Nos. 9, 34, and 42 present the surgent aspect ; Nos. 12 and 
13, the desurgent aspect. The bond between the two sections is fairly strong, but weaker than within 
themselves. Moreover, they show a different relation to No. 4. Its correlation with the group con- 
Sisting of Nos. 9, 34, and 42 is +536; whereas with the two desurgent variables, Nos. 12 and 13, 
itis only —-123 : i.e., the two emotional aspects, though related, do not move together as a functional 
unity. 'Spontaneitv carries cheerfulness with it, but does not exclude depression. 

The: i ions suggest that it is inadvisable to attempt, at least at the beginning, to locate 
a po Б factor, and that it is better to use spontaneity as a guide to determine 
the positive aspect of the main factor in question. A trial _of two arrangements, one in which 
spontaneity alone was used as a guide, and another in which it was combined with the centroid of 
variables Nos. 9, 34, and 42, showed little difference in the results. Accordingly the SIDE E 
used, and an axis was passed through No. 4, the factor obtained being provisionally name: 


Spontaneity. : » 
nd factor represents Stability or integration. To reac! 
VERONA у d of variables Nos. 5 reflected (not emotional) ; 


it, an axi d through the centroi г : 
13 selectus thee frau ts of Чоро) ; 20 (fixed in outlook) ; 32 (reliable) and 37 reflected 
(not easily rattled). 
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j 26, 480, and 489) 
i is in line with two of Cattell’s factors, viz., C and G (5, pp. 317, 326, 480, jon 
and Шы с s s by him to them are reasonably well covered by the relevant relat 
ships of our second factor as set forth in Tables V and VII moe by the first factor, the negative 
Although the positive aspect of surgency is largely accounted ‘or by the ELA 200 а 
aspect remains almost intact. А desurgent factor, named ‘d’ (mental ا‎ :h intelligible in 
by the Guilfords in 1939 (8). But such a factor, when tried out on our а. ale hough De SAL te 
itself, does not combine well with the factors that follow Шора io parse pb. tinge are not 
difficult to interpret. The various traits in the ER from them, is the remainder fendi 
ducible to а single factor ; nor, if desurgency is al presuppose some positive principle 
Swe Moreover, neurotic deviations from the normal presup| . 
of integration. . P ird factor is 
The Third Factor. The remaining factors may be described inie Беу, i Bors РЕГ, 
derived from Webb’s conception of w (17). In 1939 Reyburn and Taylor, ОУ and perseverance than 
altered the conception chiefly by giving more weight to steadiness, continu m Factor appearing, 
to action from principle or purpose (9, p. 162). In 1943 they found t с E kind has also been 
although less clearly, in some of the Guilfords" material (13). А e iable No. 33 is taken as 
postulated by other writers who have not sett ina ере arial d 
à guide to it here, and Persistence is suggested as a suitable name. : А ia) versus 
S The Fourth Factor. One of Cattell's factors, E, is described as * dominance epee ed 
Submissiveness ' (5, p. 321) ; and a similar quality has been recognized by ош nine in ND; 
fairly well in our list by variable No. 24 (dominating). But we have one a егде Са ВБЦ 
(assertive). As сап be seen from the figures the two traits—dominating and assertive—ag 
many points. But there are nearly as man 
a simple or ultimate unity, 
decided to pass an axis thr 
of self-confidence as descri 


it was 
but involves assertiveness along with other elements. Eventually it ibat 
ough No. 17. The factor of Assertiveness thus obtained resem 
bed by several other authors (18). d Taylor 
The Fifth and Sixth Factors. In the analysis of Webb's material (9, p. 162) Reyburn an eriment 
found a factor which they named charity. 1а the list of qualities used in the present Loue if it 
there is a variable, viz., No. 39 (Kind), which might be expected to prove a guide to this "however, 
is present in this analysis. Consequently an axis was passed through No. 39. The result, 
was unsatisfactory. 

The fifth factor offers a choice, for it ca: 
are variables No. 11 (easily distressed), 
on the other, No.26 a and 5 (low sta 
atic). The factor is bipolar, with sen: 
at the one end, and toughness, callous; 
ough Presents the situation fairly we 
end. 


۴ e 
n be regarded from either end. On the one uod 
No. 36 (excitable), No. 5 (emotional), No. 41 (touc! УШ” 
ndard of behaviour), No. 8 (relaxes), and No. 3 (ИШ tion, 
sitiveness, in perhaps more than one form or mani АДЕТ and 
ness, and indifference at the other. Тһе contrast of ten ositive 
ll; and it seemed advisable to make Sensitiveness the p 

The sixth factor afford: 


" Wn Р icture. 
s a measure of a feeling of Inferiority, and gives a clear, coherent pic 


a й оГ a fe с - the factor 
The transformation matrix (7) which gives effect to these ideas is shown in Table III : the fa 
loadings resulting from this transformation may be called F,. 


TABLE III. ORTHOGONAL TRANSFORMATION MATRIX 
Acc OO O O 


623 —:515 —.283 511  —052 054 
—739  —.420 —.218 314 —044 :359 
—-094 ‘S71 +123 “785 434  —4122 
455 ~-105 -560 *008 -543 :597 
114 161 300 058 —+813 455 
141 "4l — —673  —.40 150 :538 


ааа Ш С. 


опе hand, each axis may b it were the first, no allowance being made for a 
correlation of the variables with the other factors ; on the other, each factor may be DOS 
as the last, and its loads cal fect due to the correlations of the variab e 
with the other factors has bi We may consider these two processes in turn. 3 

(а) First Method. Usin Points as before, we may pass an axis through e 
of them without reference t. Other axes, In doing this we have a choice © 


g the same guiding 
О the position of the 
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starting points, vió., the co-ordinates given bv SS кз ЧК ог fbose given by tbe 
2 orthogonal rotation. Тһе use Of the i rz bs Ne NTL SHES 0, У ЗАРУБА, 


required (Т) is shown in Table IV below. 
TABLE IV. OBLIQUE TRANSFORMATION MATRIX 


100  —44  —24 326 242 —271 

0 894 519 366 —:338 —-348 

0 0 -817 -128 342. —348 

| 0 0 0 862  —331 —396 
0 0 0 0 774 180 

0 0 0 0 0 704 


^ The transformation matrix is triangular (see Burt, 1, p. 306) ; the factors obtained are oblique : 
and the cosines of the angles between them are given by the inner products of its columns, as follows : 


TABLE V. CORRELATIONS BETWEEN FACTORS 


| 
Trait | 1 2 3 4 5 6 
| 


1. Spontaneity.. Ws 1:000 

2. Stability — .. “» —:447 1:000 

3. Persistence .. Pus —:249 :576 1:000 

4. Assertiveness = +326 “181 :213 e 1:000 

$5. Sensitivity .. yix :242 —411 044 — 7 1:000 

6. ` Inferiority .. "P 271 +190 +398 ۰602 +203 1-000 


ج ن ن S‏ 


(b) Second Method. If the axes obtained by the method just used are regarded as primary axes 
in Thurstone's sense of the term, the loadings obtained are the correlations of the variables with those 
axes; and Р» will give the * structure ' of these factors. To find the * pattern ’ for the same factors 
y the transpose of the inverse of T, (Table IV). This transformation 


we must post-multiply Table II b; 


may be termed Ta. и А 
The difference between the factor loadings obtained by the two transformations can be expressed 


as follows. With six factors there are six hyperplanes each containing the axes of five factors and the 
axes themselves can be regarded as the six lines in which these hyperplanes intersect one another. 
h axis is not orthogonal to the hyperplane which does not contain 
it, but forms an angle with the normal to that hyperplane. In Thurstone’s procedure, where guidance 
m the conception of simple structure, the hyperplanes are first determined, and from 
these the normals and primary factors are derived. The orthogonal projection of the variables gives 
and the parallel projection gives the pattern—in which the simple structure 
hree sets—the hyperplanes, the normals to the hyperplanes or reference 
d the primary factors—are related in such a way that if any one 
others are also determined. Thurstone, as has been said above, determines 
But it is possible to begin elsewhere, and we have begun with the primary 
Itiplication of F, by T; gives the factorial structure, i.e., the 
to the primary axes (the correlations between the variables 
f F, and T; gives the parallel projection of the variables 
There is, however, an advantage in making this parallel projection not on 
t on to the reference vectors. To do this the loadings given by 
lied by the cosines of the angles between the primary axes and 


Since the factors are all oblique, сас! 
is sought fro: 


| the structure of the factors, 
? is supposed to be found. The th 
vectors as Thurstone calls them, an 
set is determined, the 
the hyperplanes first. 
factors. As we have seen, the post-mu 
loads obtained by orthogonal projection on 
and the factors). The post-multiplication © 
on to the primary axes. 
to the primary axes themselves bu 
F, T, would have to be post-multip 
their corresponding reference vectors. i К 1 | 
ing factorial arrangements has its own value. If simple structure 1s 
aped: ces а of course, is essential. But on the whole, when knowledge is 
nly in the process of discovery, preference, we think, should be 
tion of the variables on to the primary axes, 1.6., to the factor 
| be briefly summarized as follows : 
axes can be accurately calculated if the 
| le, are difficult to identify and recognize 


155 


Sought, parallel projection, 
incomplete and factors are О 
given to the orthogonal projec 
loadings described as F,. Our reasons may 


jecti t the loads on the 
(1) In parallel projection, al hough s io 


Whole system is known, the factors themselves, 
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i i it were, 
in the concrete. As in the case of the later factors in an orthogonal system, each factor is, as i T 
рес left when several other things have been taken away ; and generally it is not easy ps sea 
what this residuum is: whereas with EAE orfhogonal projection = to the primary > 
sychological meaning of the factors can usually be grasped more rea MES 

ВУ (2) The accel use of parallel projection requires that the factor analysis has been completes 
and all the factors found. If, for example, the centroid analysis has extracted only ive PR S ERO 
six are really present, and we attempt to find a factor-pattern on five oblique factors, s Ура er pO: 
goes awry ; for even if good guiding points for the five axes can be obtained, the directi: 2) 
jection on to them is not known. The hyperplane, parallel to which we project, E о y Sore. 
instead of (n—1) dimensions. Of course, with direct orthogonal projection on to t s eus case 
senting the primary factors, there is also error ; but the error is usually not so great. n era 
what is missing is the guidance which would have been obtained from an additional сето ie 
With parallel projection, however, there is no guarantee that the missing factor is smaller 
rest ; and moreover, the axis of each of the others is slightly out of its true position. я extract 

(3) It may well be that we cannot place all the primary factors correctly, even if we can йы 
the right number by the centroid process. In sucha case parallel projection is unworkable ; N Heter- 
the method of orthogonal projection on to the primary axes, when these axes are positively ERA 
mined, may be used for any axes that are known. Ifan adequate centroid analysis has been е, 


" = x " " ain 
a study of a single factor can be carried out with considerable success, even if the other factors rem: 
undetermined. 


V. THE FACTORS 


ave found, together with their relations to the original dup 
gures in brackets indicating the correlation between 


The six factors which we hı 


may be set forth as follows, the fi. 
factor and the relevant variable. 


alanced, 


{ 4 576); 
е | generally emotional (627); (b) nor excitable (57 
(c) лог easily flurried (-566) ; (d) restrains expression of emotions (:558) ; (e) is usually self-reliant 
(544); (/) reliable (-544) and (g) faces up to problems (528), and to tests (488) ; (A) tegue to 
igh standard of behaviour (487), and (i) has fixed ways of looking at things (476) ; (j) is slo (my 
(k) free from fits of depression (-456) ; (I) persevering and persistent (-432), and 


intaia his goi і With а high degree of this quality the subject is quick and ready to 
maintain his point of view. He is not inclined to let things Dass Sith hick he dou not aa 627); 
i odesty (633), and (b) not only submissive (62), 
врео pre E y (d) is not shy (-565) : loe indifferent 15 the views of othe 
A S :497), ап A d. Area eee 
(461) but (i contra-suggestiblé (431). (9) 191. Felines Of inferiority (466) ; (I is not sugg 


j iti - . А oking at 
things (425), and (I) faces up to t Г) ambitious (456) ; (k) has his own way of lo 
Factor V—Sensitiveness. Wit i 


h a high degree of this quality the subject is easily touched and 


indicates 
The contrast of tender versus tough indicat 
quacy. The s 


Standard of bel 1 Д xpects it O 
others (413); (d) is distressed by the difficulties of yOu" for himself (548), and (С) ех 


3 2 N -479),and (f) is 
excitable (407) ; (g) compliant and conciliatory (ines sos KO touchy (270) 
kind (:308). 

Factor VI—Inferiority. With a high degree of this li j from distressing 
feelings of inferiority. These may be covered and even А [rad et. action ; but 
they underlie much that he does. The Person who has a sense of inferiority (a) has a low opinion © 
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himself and his abilities (-596) ; (5) lacks cheerfulness (-540) ; (c) avoids the problems of life (498) ; 
(d) is slow to assert himself (470) ; (e) is prone to sad thoughts (467) ; (f) touchy (451), and 
(g) does not easily regain cheerfulness (433) ; (Л) is shy (418) ; (i) often unbalanced and immature 
(364), and (j) is a poor mixer (-360). 


VI. THE INTERRELATIONS OF THE FACTORS 


The interrelations of the factors have been given in Table V above. But since, to reach 
this result, the original data have undergone a number of computational processes, it is 
desirable to obtain some check of the reliability and objectivity of the analysis as a whole. 
To this end use has been made of fresh material. A considerable time after the original 
experiment was carried out, it was repeated with different observers, reporting on 62 new 
subjects. Then, each factor being taken in turn, weights were provisionally allotted to the 
qualities associated with them as set forth in section V above, and scores were calculated 
for each subject. These, when inter-correlated, gave the following results : 


TABLE VI. INTERRELATIONS OF FACTORS 


Trait 1 2 3 4 5 6 
1. Spontaneity.. - s$ 1-000 
2. Stability — .. vs —'181 1:000 
3. Persistence .. 25 Pi —-087 -587 1:000 
4. Assertiveness as e “117 +182 "220, 1:000 
5. Sensitivity & e. 039 —-120 —:036 —-174 1:000 
6. Inferiority —-490 —:240  —264  —:295 +120 1-000 


The agreement of this table with Table V is probably as good as can be expected with the data 
and the method adopted. All significant figures have the same signs ; most of the numerical values 


agree fairly well, the differences reaching the 5 per cent. level of significance only in two cases. Taking 
the two tables together the following generalizations may be ventured. a a 
Spontaneity tends to be associated with assertiveness and to reduce stability and inferiority. 
Stability tends to be associated with persistence and possibly with assertiveness, and to reduce 
spontaneity, sensitiveness, and inferiority. Assertiveness tends to be associated with spontaneity, 
persistence, and possibly with stability, and to reduce inferiority and sensitiveness. Sensitiveness 
is possibly associated with inferiority, end tends to до UI and assertiveness. Inferiority 
te i rsistence, stability, and spontaneity. 
ш ыыы ы е d, it is ADE to ask whether the inter-correlations of the 


When a system of oblique axes is изе i : 
factors can безхесоцц!ед for by a secondary factor or set of such factors. But in the present instance 


at actors would be required ; and we have so far discovered no convincing 
EM pa wing Ends could be rotated. All the many positions which they might take up are 
equally arbitrary ; and, in the light of existing knowledge, it seems inadvisable to lay any weight on 
these factors. СЕ: R p 
The ultimate value of factors derived by analysis lies in the light which they throw on 
the individual. They provide a framework within which the study of him may be set, and 
Obviously the first step will be to assign him a score in respect of each factor. This, however, 
does not exhaust the situation. There is what may be called a normal relation between the 
factors themselves, discovered in the first instance statistically by reference to the total 
Population examined. But there is no ground for believing that this statistical or normal 
relation must hold in detail for every individual. The interrelations of the factors in each 
individual are also a part of the true picture of him, and those с ы a I 
" irect one. The individual may be measured a con- 
Чез pd us method would be ей the factors, and the correlations resulting Bom those 
measurements may be compared with Ше norma res PE Ed eed ach pe 
oct Bar nas called Рокс nue between them in the inahin e process Would be 
aborious ; and, although it seems the most fundamental method, it is not always p 
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ing i i i ideri indivi Its to 

ithout applying it generally, it may perhaps be possible by considering individual resu 
gain pies MEE farther E on these or other lines is desirable. When the score oran 
individua! on a factor deviates significantly from the mean, we can ask whether this seve in E. 
accompanied by any other unexpected deviation in his scores on other factors. In 1 c n 
orthogonal factors, the normal exception is the mean itself ; and hence in this case the ape 
which we have to ask is whether more factors than one deviate from the mean at the same time. E 
two or more such factors do so, this may be due to accidental variations in the separate factors ; Du 
it may also be due to an unusual connexion between the factors in the individual. In the cas aed 
oblique factors, when one deviates from the mean, the most probable position of the other is potus 
mean itself, but a distance from it given by a regression equation. If the other factors do 101 a 5 
the values which these considerations lead one to expect, a qualitative analysis of the individua E 
generally indicated. If the factors obtained by analysis are used in this way their value may 
greatly enhanced. But a fuller discussion of the point must be reserved for another occasion. 


VII. SUMMARY 


1. The purpose of the experiment discussed in this paper was to modify and 
extend the analysis of personality qualities already made by the method used by the 
authors. Eighty observers each gave a personality sketch of two subjects, rating 
them also on 45 or 46 qualities. The resulting correlations yielded six centroid 
factors. р 

2. Rotation of the centroid axes was undertaken both into orthogonal and into 
oblique positions, the latter being dealt with in two ways, viz., to give structure on 
“primary factors’ and also on ‘reference vectors.) The reasons for selecting the 
positions into which the axes were rotated have been indicated above. In the end 
preference was given to the method which projects orthogonally on to oblique primary 
axes. | 

3. It is claimed that the resulting factors and their inter-correlations may provide 
a framework within which the study of the individual may be set. 
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THE SIGNIFICANCE OF FACTOR LOADINGS 
LAWLEY'S TEST EXAMINED BY ARTIFICIAL SAMPLES 


By STEN HENRYSSON 
Department of Psychology and Education, University of Uppsala, Sweden 


І. Introduction. II. The Construction of Artificial Samples. III. Conclusion. 


I. INTRODUCTION 


One of the weak points in factor analysis is the lack of satisfactory significance 
tests 1 for deciding whether the residuals left in the correlation matrix after extracting 
a certain number of factors can be explained by sampling errors or not. A step 
towards a solution of this problem was taken by Lawley (1940, 1941, 1943). Starting 
(1940, p. 65) from Wishart’s joint probability distribution of the sample variances 
and covariances,” he finds a likelihood function, which is treated as a function of the 
population parameters (the factor loadings) and for this he endeavours to find a 


practicable solution. 

His analysis starts with a preliminary estimate of the number, m, of common factors, 
and loadings for these factors are then found by an iterative process, beginning with a set 
of trial values. А method, suitable for large samples, is suggested for testing the assumption 
initially made in regard to the precise number of factors required. From his likelihood 
function he deduces that, if the assumption is correct, then the quantity 


2 


EN . 

у E [ec m „,у—1,2,3,...,® а) 
2 ss 

(where N is the number of observations, ci; аге the cova 

aij are the given covariances, and s;? are the error vari 

P degrees of freedom, where 


ariances calculated from the loadings, 
ances) will be distributed as y? with 


p=t{a- т)? —(n—-m). (2) 

“If a significantly high value of y? is obtained, this will indicate that we must reject the 
hypothesis and assume the existence of more than т factors E (1941, p. 184). This test of 
significance is applicable only if the above method of estimation is used. 

The object of the following paper is to examine Lawley's test of significance by 
means of artificial samples. The test is intended to be used on large samples ; and 
our object is to find out if it works on samples of 200 observations. This will be done 
by ascertaining how the values of у?, which we get for our different- samples, are 


distributed. 


II. THE CONSTRUCTION OF ARTIFICIAL SAMPLES 

W 3) gives 25,000 numbers taken at random from a normally distributed 
е = 1). Тһе numbers are given in ten uncorrelated columns, 
ху... ‚Жүр We construct nine variables containing one general factor according 


to the factor pattern 
gy de pes, (E61, Doi (3) 


Zi = 
! See, however, the paper by Professor M. S. Bartlett in the last number of this Journal (III, 1950, 
PP. 77-85) on ° Tests of Significance in Factor Analysis,’ especially sect. II. 

* The method is not restricted to correlation coefficients. 
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by adding to the number in column (1) first, the number in column (2), secondly, the 
number in column (3), and so on until we reach column (10). The value in column (1) 
is thus common to all the nine variables. We divide the first 24,000 random numbers 
into twelve parts with 200 rows in each, and thus get twelve samples of nine variables 
each containing 200 observations : 


Zin = XIN + xq 1)N (N = 1 2, e.s 200) (4) 


In each sample of 200 observations the variances and covariances of the variates 
2; are given by the formula 


1 1 
аў = 19g [> — y C» e| (575: 122. 2.510), (5 


Where 2: here denotes summation over the sample. Thecalculations may be simplified 
by using equation (3). Thus we have 


1 1 
йй = тос [>= + By; + ®хуху + Exo — 200 (x, + Ex) (Ex, + zx) | . © 


The sums in the equation above (which are sums over 200 numbers) may be obtained 
by adding together sums over four successive groups of 50 numbers. These latter 
have been calculated by Wold (1948) and Azorin and-Wold (1950). For the first 
sample of 200 the matrix of variances and covariances 1 was found to be as follows : 


. TABLE L MATRIX OF VARIANCES AND COVARIANCES 
و ت‎ 


Variable (i) (ii) Gi) (iv) (v) (ài) (vi) (vii) 0) 

G) $894 1819 2192 1957 1878 1987 1845 2091 1:959 
(ii) 17819 3833 1:829 168 1:757 1794 1.500 2001 1562 
(iii) 2192 189 4136 2291 1:779 2066 1393 2406 18%6 
(iv) 1257 1688 2291 3948 2069 1958 1.787 1957 1:835 
(у) Г878 1757 1779 2069 3885 1867 176 1:864 2:012 
(vi) 1287 1794 2066 1968 1867 3739 1917 2003 1815 
(vii) 154 1:503 1793 1787 1716 1017 3:358 1:608 1:835 
(viii) 2091 1769 2406 1957 1864 20 1-608 3:993 1:907 
(ix) 1:959 1:562 1:896 1:835 2:012 1:815 1:835 1:907 3:417 

Loadings 1438 1230 1495 1419 1354 1:343 


1:402 1:264 1428 


Assuming one general factor, the loadin 
method were as shown in the last line of the table. The value of у? required to test 
1-8, which, with 27 degrees of freedom, is not significant. 

à value would be exceeded by chance is 0-74. For the 
* and the corresponding values of P are as 


TABLE П. VALUES OBTAINED FOR CHI-SQUARED 


Sample | 1 2 3 4 5 6 1 8 9 10 " 12 


178 316 319 282 221 258 155 
092 025 024 040 073 053 096 


218 386 242 192 281 


P 074 007 062 086 0.41 


! When calculating the variances and covarianc i ivi В 
to make this step easier. This operation has Bou one [^a Mes огу: Brant НЫЗ 
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III. CONCLUSION 


The results obtained are in good agreement with theory. The values of P should 
conform with a rectangular distribution ; and the expected range is 0-85 (Cramér, 
1946, p. 372), a figure which is close to the observed range (0-96 — 0-07) of 0-89. If 
all twelve samples are combined, a total value for у? of 304-8 is reached. This, with 
12 x 27 = 324 degrees of freedom, corresponds to a probability P of 0-77. From 
the test we should therefore conclude that all the 9 x 9 covariance matrices can be 
expressed in terms of one significant common factor only, with nine specific factors : 
and this conclusion accords with what in these artificial cases we know to be the fact. 
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APPENDIX 
By CYRIL BURT 


Maximum Likelihood Loadings as Obtained from Single-Factor Correlation 
: Tables 


Mr. Sten Henrysson, in his suggestive paper, has applied Lawley's test of significance 
to a set of factor loadings calculated by his first * method of maximum likelihood’ from an 
artificial covariance matrix containing one common factor only. In this country factor 
analysis has been most frequently carried out, not with covariances, but with correlations, 
Accordingly, as many readers of this Journal may be interested in that form of application, 
I have ventured to convert Mr. Henrysson’s matrix of covariances * into a matrix of correla- 
tions. The coefficients so obtained are set out in Table III, with the corresponding factor- 


Saturations added at the foot. 

A word or two is perhaps required to indicate how British psychologists have commonly 
approached such a problem. In early discussions on the statistical validation of mental tests, interest 
centred almost entirely on the * general ability ° (or intelligence ), which seemed to dominate all 
Others. Accordingly in 1915 I ventured to put forward a simplified procedure for analysing correla- 
tion tables which contained no more than a single significant factor, i.e., tables of the type which 

r. Henrysson has here constructed. In such cases it seemed clear that, for the purposes of the 
Psychologist, Pearson's method of obtaining * the line of closest fit by the method of least squares 
needed modification. When we are dealing not with physical but with psychological measurements 
(the results of different intelligence tests, for example), we are almost bound to assume that a unique or 
"Ish i ank essor Wold for his help and encouragement in this research, which was 
Me spia poc en Statistics in the University of Uppsala, and Dr. D. N. Lawley and 
the editors, Professor Thomson and Professor Burt, for their assistance in preparing my manuscript 

Or publication. ы 4 "m 

> Mr. . 160) “when calculating the variances and covariances, we divide 

by LOD eee ороор. à Mes to obtain variances and суа A TRA the 

100/ od al procedure, his figures would have to be halved or (more stric y) to be р y 
9. 
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specific factor is present in each test. The larger the specific factor, the more it tends to reduce what 
(in a somewhat broad sense) was originally called the * reliability ° or * precision ' of the test. From 
this it was argued that, in accordance with the usual least squares procedure, the correlation table 
should in theory first be * prepared ’ by dividing each standardized variable throughout by the square 
root of its unreliability. And the argument was later extended to tables in which several common 
factors might be involved. In the case of an ideal single-factor matrix it was easy to show that this 
theoretical correction led to the same figures for the factor-saturations as would be obtained by 
applying Pearson's procedure direct to the correlation table with reduced self-correlations in the 
principal diagonal instead of unity ; and further that, in such a case, this procedure itself could be 
further simplified by substituting the formula for simple summation for the more elaborate calculations 
required by Pearson's method of principal axes. It was therefore suggested that, even in dealing 
with an empirical table where the observed coefficients did not differ appreciably from a perfect 
hierarchical arrangement, the same shortened procedure might legitimately be adopted." 

As applied to the problem of fitting a table of correlations the method of simple summation was 
identical with that already in use for fitting a contingency table (1, p. 64, eq. 1) ; and it was natural 
to suggest that tbe discrepancies or ‘ residuals ’ should be tested for significance in the same familiar 
fashion, ramely, by the chi-squared test (1, p. 64, eq. 3). On the same grounds as before, however, 
it appeared desirable to retain the correction for specificity: for the residuals before testing their 
significance in this way. This resulted in a formula for residual correlations which was identical 
in form with the Pearson-Yule formula for partial correlation : (cf. 2, p. 57). The similarity, of 
course, is not accidental : precisely the same principles are involved in the two arguments. 

The chi-squared test furnishes a significance test for the system of residuals taken as а whole, 
and was thus used chiefly with a bipolar analysis. To locate significant residuals occurring either In 
isolation or in clusters, it is more instructive to test each separately by an appropriate probable or 
standard error ; this method was accordingly used in group factor analyses. Since each augmented 
residual (or specific correlation,’ as it was called) could be regarded as a partial correlation with the 
general factor eliminated, it seemed legitimate (with large samples) to test it by taking the ordinary 
standard error for a partial correlation of zero, that is, by taking 1/4/ М, where N denotes the number 
of persons tested : (1, p. 352, eq. 15). This implies that the augmented residuals can be treated as 
conforming approximately to a àormal distribution with a standard deviation of 1/4/N ; and, on 
Г eae the sum of their Squares, multiplied by N, would conform to the chi-squared 
boris QUE n. us, without any very rigorous proof, it was assumed, and the assumption seem 

in practice, that the two procedures would yield much the same results. 


The object of this note is to show that, in cases like that selected by Mr. Henrysson, the 
results obtained by this simple procedure * are identical with those obtained by the 1016 
elaborate method of calculation which he has adopted in his paper. ` 


1 i 8 " А А 
This procedure was first briefly outlined in a paper, submitted to the Psychological Subsection of the 


British Association, on * General and Specifi i i ions’: (the 
сше ч Аш ecific Fact Primary E s': (th 
summary, as printed in Brit. Ass. Ann. р: ors Underlying the Primary Emotion 


: Rep., 1915, pp. 694-6, gives the figures so obtained, and 
a halter ЕЕ concrete results, but does not describe the technicalities of the method in any detal > 
Subjects in the School Сит ul men Report on * The Relations between Ability in Different 

е culum i i : р гал olations 
of Educational ‘Abilities, 1917). » reprinted, with some abridgment, in The Distribution and Relati 

t is perhaps necessary to remind those who refer t. i has 
8 Н Я У o these earlier papers that the use of terms | a 
prende ud changed. In conformity with Galton's terminology, the phrases ‘ specific abilities," 
be called * uL P Specific correlations ’ were originally employed to refer to what would now 
Andi pupils NO AERA f Me phrase * specific factor, however, was later adopted by Spearman 
factor’; and“ reliability " actor peculiar to a single test, i.e., what is now sometimes called a * uniqu 


ule was used to mt z HORAE | ا‎ ^ oy ance what was 
originally called * correction fo ean, not" precision, but * self-consistency.’ Hence what W 


ons with unity in the diagonal, but RR Ru) R= 
al matrix of specific factor variances, and Re=R— Гы 
ШЫЙ d ations. This formulation leads to equations of t 
ЈК, — val) = 0 and | А, i 
a— vl | =0; [0] 
but the formula now relates to correlations corrected for * unreliability. And the pro- 


cedure requires, not Spearman's correction formula у „ but the complementary formula, 
Fa rary 

wher iabilitv* ; ili‏ ا 
Vi-ra у’ € raand ry denote ‘ reliability ’ interpreted as above (viz., ‘ reliability ' SO‏ 
far as the tests i and j depend on common factors only).‏ 
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TABLE III. THE ‘OBSERVED’ CORRELATIONS 


Variable (i) (ii) (iii) (iv) (у) Gi) (vii) (viii) (ix) 
[0] (5314) 4708 -5462 -4991 -4828 -5207 -5102 -5303 5370 
(ii) -4708 (3950) -4594 4339 4553 -4739 -4189 -4522 4316 
(ш) +5462 -4594 (5392) 5670 4438 -5254 4811 -5920 5043 
(iv) 4991 4339 -5670 (5102) -5283 :5122 -4908 -4929 -4996 
(v) 4828 -4553 4438 -5283 (4736) -4899 4751 4733 -5522 
(vi) -5207 4739 5254 5122 4899 (5277) -5410 5184 5078 
(vii) -5102 41899  -4811 -4908 4751 5410 (4749) 4391 5417 
(viii) -5303 4522 -5920 -4929 4733 5184 -4391 (5085) 5163 
(ix) +5370 4316 — :5043 -4996 -5522 5078 5417 -5163 (5288) 
Simple | 
Summation -7302 6285 +7335 :7140 -6887 -7268 -6886 7123 ‘7273 
Weighted 
Summation :7290 :6285 +7343 77143 :6882 -7264 -6891 77131 77272 
Loadings : 
from 
А Correlations |1438 1:230 1493 1419 1:356 1:404 1:263 1:425 1:344 
тот 
Covariances | 1-438 1:230 1:495 1419 1:354 1402 1264 14% 1:343 


In the present case the working is as follows. Starting With the ordinary table of correla- 
tions (Table III), saturations have first been computed by * simple summation ° (last line but 
three in the table). These have then been taken as a basis for obtaining the improved values 
furnished by ‘ weighted summation’ (last line but two). It may be noted that, although 
the matrix is not perfectly hierarchical, the use of weighting introduces hardly any change in 
the figures. Each saturation has then been multiplied by the square root of the variance 
as given in Mr. Henrysson’s table. Thus for variable (i) we obtain :7290 x 4/3894 = 1:438, 
and so on. It will be seen that (except for trifling discrepancies) the loadings thus finally 


reached are identical with those obtained by Mr. Henrysson from his matrix of variances and 
covariances (see last two lines of Table Ш). 


ues for the saturations, a * hierarchical ’ matrix of correla- 


Next, taking the improved val : f 
5 р sual way to fit the observed correlations. The differences 


tions has been constructed in the u 


Now, if Re is a matrix of rank one, the ordinary method of expanding the characteristic determi- 
, 


nant leads at once to 
0 = | R- vel | = ve" —(ru t res oe da) Very. 

z; and similarly | Ra — Ye 1| leads to va = 0 or va = 
n Rob fü Rut. Hence DE may ш. 5 - Ro and 
Я à ti these conditions, the results of factorizing 
va = f/ Réf, From this it ts easy to show that, under ions, th 

R. wilt satisfy equations (0. Substituting the values thus found for fo’ and Ya, we have 


fai Ra = féR - Rit Re Rut = fé Ru fe. fe Rut! 
Bf RO fife = vafa: 


Ye o£ - ired. Thus, when the reduced correlation matrix is hierarchical 
а is, fe’ (Ra — Val) = 0, as required. Thus, ions entailed by equations (i): the observed 
eee is no need to сап, xcept that reduced self-correlations must 
free Eon AUI] fOr unity. Since in such a case rü — rij, the expression for the residuals 
Becomes el ПН». (Note that, if Re involves several common factors, Fe Ru" F, is 

— — n т CEDE 1 
not à PEE Eo em thus does not satisfy the * orthogonality condition’ in the form imposed 
‚а 
by Lawley.) 


which gives ve = 0 or Y 


Af But Ri = Re 
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i in Tz i * residuals °) 
between these theoretical figures and the observed figures in Table III (i.e., the first residu: 
i hen been divided by the product of the square roots of the two relevant specific factor 
variances. Thus 


:4708 — -7290 x -6285 7 0238 
nes = V1 — 12905 VÎ — 6285; id 
These * corrected residuals ' are shown in Table IV. 


TABLE IV. THE ‘ SPECIFIC’ CORRELATIONS С m 


Variable (i) (ii) Gii) (iv) (v) (i) — (vi) (vii) (ix) 
(i) = 70238 -0232 —-0449 —.0376 — 0183 —:0157 -0215 0149 
Gi) (0238 =—  —:0042 —-0276 0405 0327 —-0254 -0070 —-0476 
Gii) 0232 —0042 — 0892-1249 —-0172 —-0510 -1430 -0637 
(iv) —0449 — 0276  .0892  — 0727 — 0135 -0028 — 0341 0431 
(v) —:0376 0405 -1249 77  — —.0194 -0017 — -0342 10 
(vi) —0183 0327 —-0172 —-0135 —0194  — 0812 0009 E 
(vii) —:0157 —-0254 —-0510  -0028  -0017 02  — —-1035 E 2 

(viii) 0215 — -0070 -1430 ~-0341 —-0342 -0009 —1035 —  —-00: 
(ix) 70149 —-0476 0637 0431 -1044 — 0428 -0816 — 0054 — 


— ا د‎ Ro са 


With 200 cases the standard error for a partial correlation of zero may be taken as 
1/4/200 = 0:0707. Of the 36 fi 


figures in Table IV, only one is more than twice the standard 
error, which is in agreement with Mr. Henrysson’s value for P (0-74). Chi-squared can be 
calculated by squaring each of the * corrected residuals,’ adding the squares, and multiplying 
by N = 200. The figure so obtained is 22-0 as compared with Mr. Henrysson’s figure cd 
21-8. Thus chi-squared for the residual correlations is practically identical with chi-square! 
for the residual covariances, as was after all to be expected. (Неге, and in the foregoing 
figures for the loadings, the slight discrepancies between his figures and my own are no larger 
than might be anticipated from a difference in the extent to which the iterative calculations 
have been carried. Indeed, had we dispensed with iterations altogether and merely taken 
the figures given by simple summation, the result would have been virtually the same.) | 
Tables III and IV above Corres; i i а nd Tables 
XXILand XXI o Mies pond respectively with Tables XVIII and XX a 


saturations there given were di 


ordinary reliability coefficients 
The use of reduced self-correlations was stron itici i Pearson 
| 1 gly criticized at the time by both Pe 
and Spearman, and has since again been questioned in the last issue of this Journal bY 
reee E ed pu it may, I think, be claimed that the agreement between e ren 
1 stitution and those furnished 3 <i likelihoo 
can be plausibly adduced as on а cds LINT 


Е е further argument in f; f-correlations, at 
any rate in such cases as the pretent. gu avour of reduced sel 


; һе 
н Mr. Henrysson ga equal weight to t 
с factor in every case, Hen pected GRE foe 


0:035. At first sight the si 
that a rough estimate of the st: 


^ In Human Ability (1950) Spearman repeats his criticism of “ i iagonal places 
by stopgaps," which he thinks “ quite arbitrary (pp. 29.30) pev rie MEL Lc see Brit. 
J. Psychol., Stat. Sect., Ш, 1950, р. 126, and refs, = ы 
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ments by Mr. H. F. Weston and myself some time ago indicate that such an estimate is decidedly too 
low. The approximate formula I myself have suggested (Mental and Scholastic Tests, p. 139) here 
gives a value of --0:058. But further investigations along these lines would be highly instructive.* 
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1 In a study of significance tests Mr. Weston has calculated artificial tables using different proportions 
for the general factor in each test, and including in some cases a second bipolar factor. Here too the 
figures obtained by applying the least squares procedure (* weighted summation ’) to the correlation 
table, with reduced self-correlations in the diagonal, prove to be very similar to the figures reached 
by the more laborious procedure described by Lawley and Emmett (this Journal, 1I, pp. 90-7 ; cf. 
Burt, ibid., pp. 106, 113). With the chi-squared tests he finds the ‘correction for specificity’ rather 
more generous than the z-transformation. It would be instructive if Mr. Henrysson could give 
detailed figures for the loadings in all his twelve tables. At the same time it should be remembered 
(as Dr. Lawley points out in a note he has been good enough to send me) that the errors of estimation 
in the nine loadings would not be uncorrelated. He adds that according to his own formula the 


standard error should be about +0063. 


o 
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THE FACTORIAL ANALYSIS OF QUALITATIVE DATA 


By CYRIL BURT 
Psychological Department, University College, London 


I. The Importance of Qualitative Data in Psychology. ЇЇ. Alternative Statistical 
Techniques. Ш. The Treatment of Multiple Determinates. IV. А Factorial Analysis 
of Physical Attributes. V. Summary and Conclusions. 


I. THE IMPORTANCE OF QUALITATIVE 
DATA IN PSYCHOLOGY 


The Form of the Data. In many investigations within the field of individual 
differences, the available data are expressed, not as quantitative measurements stating 
magnitude or degree, but in terms of classes or attributes which are essentially 
qualitative. Statistical psychologists, and particularly those who have worked with 
standardized tests and employed factorial methods, are often accused of ignoring 
the qualitative aspects of their problems. As a rule, their critics seem to assume 
that, because such observations are not recorded in the form of quantitative assess- 
ments, they are no longer amenable to quantitative treatment, and can therefore have 
no use or interest for the statistical investigator. That, however, is a patent fallacy- 
If they are to be tabulated, such Observations, it is true, must be set down in what the 
schoolboy calls the * nought-and-one’ style, where each * one? will signify that yet 
another individual belongs to the class named or possesses the attribute specified. 
and ‘nought’ will signify that he does not. But, when that has been done, it is 4 
simple matter to count the ones and compare their sum with that of the ones and 
noughts added together: in this way we can readily summarize our observations In 


the form ofa table of frequencies or probabilities ; and these can manifestly be 
subjected to statistical treatment : (cf. 3, 4, 5). 

‚Їп psychology the oldest and most familiar instance of this procedure is furnished by the marks 
| ge о, P Binet Simon tests. Here each examinee is in effect awarded a measurement of ' ond 
а Ies е EE. and ‘nought’ for every failure. A similar device has long been adopte 
p cache in marking the simpler type of examination paper: each child's total score is obtaine 
S Bs ang every correct answer as * опе,’ and adding up the total. So-called personality-tests 

d е Rorschach, the personal inventory, the biographical questionnaire, and enquiries about interests 
an Aer for example—are frequently scored in this way. 
Dreh. fie ера no doubt, most mental traits (probably as a result of their highly complex 
M dera у Heus of continuously graded variables. But the patterns, the combinations, s 
reduce them) oft E ve Such traits (or between the statistical factors to which it is convenient 
Шой Me m à qualitative character, which cannot itself be expressed as a difference tis 
pce mule in voua psychology is not concerned merely with the description of individua 
aad his cbe 198: It has to take into account the causal influences which affect each individua, 
qüntitative СЫМ КЕР уы ; and many of these causal influences аге not themselves reducible | 
take up pointe which lesd or example, in my early studies of delinquency, it was often essential o 
Is he an only child ? г ie a classification rather than a measurement. Is the child illegitimate 5 
What is the lad's гезе " e Ing with his parents? Has he suffered from this or that disease * 
Genre on ordina eale, otonly Dy a cafe, bed and many other questions cannot be answered MO 
by n this and е not that or duy oiher шон an cine cin 

or are qualitative data of this nature confined solely t cur in many 

Ped Ede ie Physical anthropology, for example, the peo aser ИАДА to deal with 

Know paper on he cote DEC bules, s contrasted with "variables. Pearson s well, 

need to deal with qualitative traits Epp ur e см prompted DY fier 
H E 'e-colour. as a a 

Срез of mule PE PoE (6) will show, some of be stock ае or E procedures are 

s ‘awn from the anthropological field. More recently in biological research, and particularly 
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, The distinction between * measurement-data ' and ‘ frequency-data ' and between * continuous 
distributions ' and * discrete distributions ’ has long been recognized in statistics. And the growing 
realization that mathematics is not restricted to the study of quantity has made it increasingly possible 
to discuss qualitative characteristics in the same rigorous way, and often along the same lines, as 
quantitative. On the wider issues I may perhaps refer the more advanced student to the contributions 
of French statisticians, who have dealt most fully with the problem.* 


Now we must, I think, agree with our critics that, in studying individual differ- 
ences whether in physical or in mental characteristics, factorial investigations have 
hitherto concentrated chiefly on quantitative measurements. It would, however, 
be plainly advantageous if all kinds of characteristics, qualitative as well as quantita- 
tive, could be included in such studies. And the chief purpose of this paper will be 
to indicate how factorial methods may still be applied, even when much of the data 
is expressed in a qualitative form. 

The Nature of the Factors. Аѕ with quantitative variables, so with qualitative 
attributes the factors thus derived will be essentially principles of classification. 
Indeed, the factor analysis of attributes is related, much more obviously than the 
factor analysis of variables, to the modes of classification employed in the simpler 


Sciences. 

, In botany, for example, the trained observer inspects in order the parts of the plant he is seeking 
to identify—its root, its stem, its leaf, its flower, and its fruit ; and knows that each of these parts тау, 
exhibit one of a number of alternative forms or qualities. The veining of the leaf may be parallel 
or netted ; the colour of the flower may be white, red, yellow, blue ; and so on. Even where some 
of the features have numbers attached, e.g., 3, 4, or 5 petals, these numbers still specify, not differences 
M degree, but differences in kind. The diagnostic combinations of such attributes correspond to the 

actors. 
The chief divergence between problems of classification in the Simpler sciences and in the anthro- 
pological sciences is that in the former a far higher degree of certainty attaches to the inferences to be 
drawn. Thus all tulips have parallel veins ; nearly all hawthorns have flowers that are white or 
red; and, with few exceptions, all clover-leaves are threz-lobed. In chemistry exceptions are 
rarer still: all metals of the alkali class are univalent ; all occur in the first group of the periodic 
table ; all act as powerful bases ; all are strongly electro-posilive, and the strength of this tendency 
almost invariably increases with increasing atomic weight (here, we may note in passing, the 
classification turns in part on differences in quantitative va bles as well as on qualitative attributes). 
On the other hand, we do not find that all children who fail in the Binet tests for Age VIII also fail 
in those for Age IX ; nor do we find that all leptomorphic individuals are schizothymic or that pyknic 
individuals are nearly always cyclothymic. A correlation technique therefore becomes essential. 
With qualitative variables as with quantitative, the most useful factors will be 
those that lead to a hierarchical classification ; and the natural procedure will be 
to seek, in any particular * domain ’ or * universe of discourse," first the factor which 
will account for the largest amount of individual variation, and then the factor which 
will account for the largest amount of residual variation, and so on. Thus, to adopt 
an expressive term familiar to those brought up on. Aristotelian logic, a factor 
may be thought as а Siagood єйзоттобс ?—a hypothetical differentiating principle, 


1E.g. ibuti different writers in Borel, E. (ed.) Traité du Calcul des Probabilités (1924 
ане poet on the fact that, whereas earlier writers who touched upon the statistical 
treatment of psychological problems classified * statistical series ’ into qualitative and quantitative 
more recent writers (like Garrett, Thurstone, Engelhart, McNemar) have preferred to classify series 
into ‘ discrete’ and * continuous,’ and that, as they use the terms, the distinctions are by no means 
identical. Quantitative scales may also be discrete ; and, solan Be ma ШЫГ [xo end 
distributions, they restrict themselves almost entirely to quant ЕТ, n ын s vs DR ai ave that 


ing, let me emphasize at once that 1 r e 
S can be adequately dealt with by just tabulating frequencies. In 
it-assessment, whether qualitative or quantitative, should always be supple- 


my view th 0 trait f 1 
mented by he e ly Neither is complete without the other. The ancient controversy 
between ‘clinical’ and ‘ metrical " psychologists has eth large y. bc a io ды 
assumed that his own method was the sole method. How the two in E eh may bo com 
bined in a singl n d to show in my book on The Young Delinquent ou 
arch I endeavoured. 2 3 З 
hold that аа Every problem of individual psychology demands this double mode of treatment. - 


? Arist., Top. VI, vi, 2; cf. Nicom. Eth., X, iv, 3. 


To avoid misunderstan' 
all the psychologist’s observa 
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constituting a species.’ Remembering that ‘species’ is а relative Teme E p 
therefore expect the widest factor to represent the genus under me mn: aed 
the remaining factors to represent the differences between species, su sse m 
So on, in hierarchical order. However, as modern logicians have so often poin d 
the traditional notion of classification found in the older text-books of logic assun ЫП 
a neatly arranged system of natural kinds—each genus sharply distinguished о 
other genera, each species or type unequivocally separated from the rest ; hu this 
in actual fact there is a vast and often irregular amount of overlapping, an ul 

overlapping is greatest in the more complex sciences. Hence, in my view, Ше ny 
safe and speedy way to reach a satisfactory solution in such cases is to invoke Ше! 
of a suitable statistical technique. And, if the statistician has not as yet develope 


appropriate method which he can offer us, then the psychologist, however imperfectly, 
must set to work to devise his own.! 


II. ALTERNATIVE STATISTICAL TECHNIQUES 


Correlation in the Case of Two Dichotomous Classifications. In my first aor r 
factorizing results obtained with the Binet scale, each test was paired with every other, ‘the 
the number of children passing both was taken as indicating the association between kid 
abilities so tested. In view of the large number of tests in the scale, the frequencies for ES 
Several pairs were entered in a single table of double entry, like an ordinary contingen 4 
table : the whole table was then fitted with theoretical values by means of the simple sum 
tion formula ; and the sums were taken as indicating the degree to which each test. s 
Tepresentative of the common factor. In later researches, when the factor айа се 
correlation coefficients had become a recognized procedure, attempts were made to TC es 
the initial data to correlational form. Since cognitive abilities are continuous ede т) 
the ideal procedure, so it was thought, was to modify the Binet problems, wherever ро late 
in order to secure graded measurements, and, where this was not possible, to calcu 
2 The logical difficulties surrounding attempts at classification in the more complex pone ае 
perhaps most clearly brought out by Sigwart's discussion ; and his solution is in keeping wit SVG 
Procedure suggested above. By what principle, he asks, are we to be guided in our “ DIOE ОП 
efforts to work by ascending generalization from the lower to the higher classes, or downwards that 
the higher to the lower ” ? At each stage, he replies, ** the most serviceable arrangement will be m^ 
which groups together those things which have the largest number of characteristics in sammo a 
(Logic, Part III, * Technical ^ 5 esp. рр. 158f. : it is suggestive to note that he proceeds to cite, ал ers 
cases showing the most difficult and complicated Structure, the attributes or * elementary charac 
istics* studied in psychology). 


But how (he goes on to ask) are we to compile our preliminary list of characteristics in order о 
See what is the largest number that certain groups have in common ? That, as he points ou rial 
plainly not a formal but a material problem. To which I need only add that, if it is a mate 
problem, then itis not a problem that can be decided by statistical calculation alone. ents, 

„There is, I am well aware, a widespread notion that, in dealing with quantitative measurem К 
the investigator can proceed by first taking all the tests that are applicable or throwing togethe 


> > Н ” in 
data that may be available, trusting that the Statistical analysis will then “ discover " the underlying 
at, I venture to assert, is q 


) , all such prior considerations wil] 
analysis, but on concrete ob: ative hypotheses. 


5 а 
2 Cf. Yule, U., loc. cit., p. 64, eq. 1, and p. 66, Table III. Ttw: s in fact this formula which 50020516 
thë use of * simple summation ' and the application ofthe ЕНА est Bom eda for correlation 
variables in the ordinary way. 
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tetrachoric coefficients, on the ground that the distribution of such abilities conforms pretty 
closely with the normal curve. 


However, as I have already pointed out, with much of the data obtained in work on 
non-cognitive differences, the information may be essentially qualitative in nature and 
available only in classificatory shape. In such cases the tetrachoric coefficient is quite unsuit- 
able. Nevertheless, without making any specific assumptions about the nature or distribution 
of the underlying variables, it is still possible to express the association between such charac- 
teristics by means of correlation. Thus, to obtain what in form at least is a product- 
moment coefficient, Yule ? suggested taking the line of division between each pair of classes 
as the arbitrary origin and the class interval as unit, and then calculating the mean, the 
standard deviations, and the product-sum in the usual way. Using the notation I have 
adopted in previous articles," suppose that, in a population of N = No + Nj, a given attribute 


is present in N, cases and absent in No cases ; then the mean will be 3s (№ — №) = N » 


ius / N.Ni e РЕР 
and the standard deviation VM This is tantamount to assigning a value of 


М, 


Es Ne (the mean of the positive subgroup) to denote the presence of the attribute, and — N 
(the mean of the other subgroup) to denote its absence. We can now compute the frequencies 
for joint deviations ; and, with this or some equivalent device, we can always calculate a 
complete set of correlation coefficients between all possible pairs from a series of attributes. 
Then, if we wish, the table of coefficients can be factorized, and the attributes expressed in 
terms of the factors so obtained. This method—namely, the calculation. of coefficients 
from all the 2 x 2 tables formed from the initial tables of frequencies—was adopted for a 
factorial study of the Binet tests, and appeared to give quite satisfactory results.* 


Available Procedures. Evidently, then, there is no intrinsic difficulty about factorizing 
data of this kind. According to the problem at issue or the assumptions we prefer to make, 
various working methods may be adopted. From the standpoint of the research-student, 
perhaps the simplest way to classify the possible procedures is to begin with the familiar 


distinction between (A) methods appropriate to cases where measurement is at least con- 


ceivable and (B) methods appropriate to cases where it is more natural to think in terms of 


frequencies. 


(A) Methods of the first type proceed by converting the initial data, explicitly or implicitly, into 
quantitative form, and then base the factor analysis on à table of correlations or of some equivalent 
type of coefficient. For the present purpose the most useful coefficients are the following : 

(i) the product-moment correlation for a twofold point distribution (Ф) ; 
(ii) Yule's colligation coefficient (Ф for an equalized table) ; 
(iii) Yule’s association coefficient ; 
(iv) Pearson’s coefficient of contingency, 
Tschuprow’s formula ; 
(v) Pearson's tetrachoric coefficient ; 
(vi) the correlation ratio. 


corrected if necessary for degrees of freedom by 


Analysis of the Terman Bint Snie Brit. J. Eduegi, 
XII, 1942, p. 122. the relative merits of different methods of calculating association coefficients 
for such data as а Саодат by the Binet tests, cf. Mental and. Scholastic Tests, pp. 137 and 232, 
also The Backward Child, Appendix Ш. ] 1 
‘The Mea mene of Jniellipenee by the Binet Tests,’ Eugenics Review, IV, 1914, pp. 140f. 

* Yule, U., Introduction to the Theory of Statistics, 1912, pp. 216-7. : 

? E.g., Burt, C., ‘ Statistical Problems in the Evaluation of Army Tests,’ Psychometrika, IX, 1944, 


р. 225, and refs. 


* Cf. Mental and Scholastic Tests, p. 137. Аз I noted at the t e ps г 
Coefficient or point-distribution correlation commonly introduces “an additional factor which 


Seems to indicate the relative difficulty of the various tests * (loc. cit., font T d deu. asa 
rule, does not appear when tetrachoric or ordinary product-moment coefficients are used, because 


they eliminate differences in the point of division. bd 


ime, however, the use of an association 
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(B) Methods of the second type leave the data in qualitative form. In dealing with such 
material we should, I think, frankly recognize that factor analysis may be legitimately applied to any 
kind of symmetrical table, not merely to correlations or covariances. In this case the figures:to be 
factorized will mostly take one or other of the following forms : 

(i) the absolute frequencies ; 
(ii) the relative frequencies or probabilities ; 
(iii) some other ratio, e.g., the square contingency or its derivatives. 


To a large extent the distinctions on which the above classification rests are distinctions 
of procedure rather than of principle. Both the contingency coefficient and the root-mean- 
Square contingency (Ф) are closely related to the square contingency (у?) and to the frequencies 


the Rd it is based ; and thus the two main types of procedure tend to merge the one into 
the other. 


ПІ. THE TREATMENT OF MULTIPLE 
= DETERMINATES 


,,, The General Table of Observed Data. With the Binet-Simon scale, and, indeed, 
with most of the qualitative tests which the psychologist is likely to factorize, the 
presence or absence of the * attribute yields in each case a dichotomous or* bipolar 
Classification of the individuals observed. For the majority of such cases I have 

` suggested that the c-coefficient can be regarded as the simplest and most convenient 
procedure. ! However, in constructing contingency-tables, it is often essential tO 
consider Cases in which the classification is (to preserve Yule's terminology) not 
twofold,’ but * manifold. 2 Tables of this type, and particularly the compoun 
table, where there are not only more than two possibilities for each trait, but also 
more than two traits (a type of table which Yule barely mentions), have been much 
neglected in the past 3 andit is one of the chief objects of this paper to direct attention 
to the problems Which it raises. Evidently a twofold classification may always be 
regarded as a special and simple case of a manifold classification. Hence it will be 
more instructive to discuss the question in its most general form. 
» In E mM classification “ the individuals are first divided under mi heads, As » 
Ar n “An 3 thisc assification may then be compared with a number of other classifications, 
ivi ШЕ the same individuals into the classes Di, Bs, .. . By, and so on (6, р. 60). те 
ОШКО: cases I propose to introduce a terminology familiar to logicians, ап 
the ааа CET to which the classification relates (i.e., what scholastic writers {еш " 
Meter CIE v ДОШ) а determinable,” and the alternative qualities assigned to T 
called a * determinable, and the variou, gade 19ке ES example, hair-colour may "s 
‘red,’ ‘ dark '— the * d andt € various shades that hair-colour is recorded as taking— fa! o 
ý e determinate? Accordingly, the matrix of observed data which W 
1 The problem arose afresh i don wi А ists 
working in the Dicono Es eno with the analysis of anta Lupe. d уор the 
dase in which haces ET id emorandum which I was asked to etna f 1943. For the эрес 
Me рк: 2 eld simple dichotomous classifications I there suggested the proced Sa 
the resulting 2 x 2 tables in pla Substituting a point-distribution calculated 9 
of approach was suggested by Mr. P. Slat te or relative frequencies. A slightly differen tive 
paper оп * The Factor Analysis of 2 Ma CU HA has since described more fully ne suet oa, 
pp. 1 d ir. practi (топ theoretical objections, I am inclined tp Think that a Working procedure 
concerned (ebyen o "ua somewhat laborious, oreover, most of the traits with which he is 
imply graded characteristi emotional instability" and the like) seem to e n. 
Hence for these the tetra: did usen epptoximately norma ae 
Тир eem to be more appropriate. 


2 I have borrowed the terms from W. E. Johnson, Logic (1921), Pt. I, Chap. XI, p. 174 
>*t 1, Chap. XI, p. . 
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require to investigate will have the form ? shown in Table І. Let us call this matrix Mı, 
Since it corresponds to the non-standardized matrix of observed measurements in quantitative 
investigations. 
TABLE I. OBSERVED TABLE SHOWING PRESENCE OR ABSENCE OF 
ATTRIBUTES FOR A GIVEN SAMPLE OF PERSONS 
IEEE ddp SS ——— 


Persons (N) Total for 
Row 
Tom Dick Harry T George 
Determinables (m) 

Hair-colour 

Fair 1 0 0 0 N, 

Red... 0 1 0 0 Na 

Dark 0 0 1 1 Ns 
Subtotal .. T" 5 1 1 1 1 N 
Eye-colour 

Light te - 1 0 1 ai 0 N, 

Brown si n 0 1 0 1 Ns 
Subtotal .. - ue 1 1 1 5 1 N 
Total for Column s m m m sun m Nm 


It will be convenient to adopt the following notation : * 


Number of determinables T T T 4 
Number of determinate values in the Ist, 2nd, . - . jth, 
...mth determinable .. P an ae 

Total number of determinates .. els T n= n, t na +.. m 

Total number of persons for each determinate E Ny, Nas... №р 

Total number of persons in sample — .. T M N =N, + Nat ot Nw, 
Number of persons possessing both the jth and the kth 
determinates — .. ۴ ж» «s кз ВМ ik. 
Since for each determinable every person must possess one determinate value and none 
can possess more than one (e.g.. every person must have hair of some colour—either fair, 
dark, or red), the subtotal obtained by each person for each determinable must be 1. It 
follows that, if for the first determinable (say) the number of rows is 71, then, when we know 
the values for (n; — 1) of those rows and knowing the values for the row of totals (1), we can 
immediately deduce the value for the remaining row. Accordingly, assuming N > п, the 
rank of the matrix represented by Table I will be (т — D + (пъ — 1) +... + Gm — 1) +1 


=п—т-+ 1. и 
1 Since for terminable deduced from the preceding rows, 
We may, if Ss Biden, guit it from the subsequent calculations : that would evidently give rise to a 
slightly different form of analysis, but the modifications pasla will be үч) = 
of the determinables permits of two alternatives only (as in dealing wi ch 
* present’ ог“ Дн у with test-items that are marked only x р eae aatan dre 
only o inates the for r tl B 
Tet me. add e kc pre k with compound dh TP ү; 
infrequently the mere presence of one may have no diagnostic signifi- 
ance by themselves, but their joint prese h m 
girl, pallor by i has little importance, and increasing plumpness still less ; (о 
a Once [d ee шоч. Similarly in emotional and moral disturbances, it is the total 
syndrome that is of diagnostic importance, not t | n 
any such compound characteristic Should still be treated as a single attrib 
ie nai tables, I shall keep, so far as possible 
® Here a i 75 notation for contingency-tables, keep, 50 га > 
to that. айе арай os Pager adopted by most statistical psychologists in this country (cf. 


Kelley, T. L. Statistical Method, 1923, рр. 263). 


nm, 


пуу Ha ey ees ту 
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: iei ions for a principal 
i '(R— = 0, we must have f" R = vf’: i.e., iff’ denotes a row of saturations incip 
Pid is suceda matrix R, the row of weighted column sums obtained by раи 
the saturations by the matrix R must be proportional to the original row of satura ы ашыу, 
constant of proportionality must be /'/ = v, the factor-variance or ‘latent таан “окота 
consider the section of the ith column of А for the first determinable. This has the following 


On multiplying each row of R by the saturation coefficient, we obtain 


Nj _ й М, = Ny " 
VN, Nj N VN; N 
-Ni ا‎ £x cJ 
VN; N; N © УММ 

Ny (№ _ NS 
VN, № N VN, N 


Р > T es, 
Butthetotal numbershowing these determinate values is №; (i.e., if there are only three such valu 


m 
Ny + Ny + № = N). Hence the total of the column will be VAN - хі. If there аге | 
iate 
determinates, the total for each column, after every row has been weighted by the арры 
r 0! 
saturation coefficient, will be т JR. Dividing by m (the latent root), we obtain the saturati 


coefficient. The required condition is thus fulfilled. rminate 
„ Since the number of factors in R (i.e., the rank of R) is less than the number of de т. 
traits (i.e., the order of R), each of the remaining latent roots will have a value that is less tha 


n Л А е tota 
Hence the factor just extracted is the factor which accounts for the maximum amount of the tota 


variance, and may therefore be regarded as the first or general factor. 
From the values 


Б z тепаа ruct the 
n 1 given for the saturations by equation xii, we can now reconstruct U^ 
symmetrical hierarchy due to the first factor, namely, H = Sf’. We obtain the symme| 


matrix 
N, VMN, МММ» 
N N U N 
V NN, М» VNNn 
N N idi N 


: A nt 
We may write the general element in this matrix hi = УММ; . But the general eleme 


N 


А А s in 
in the standardized product-sum matrix was rij = ig . Hence the general element ! 
the residual matrix will be x 
rg — hj = ANI 
m (ws — :3É y 
NOW NiNj 


i ‹ " ‚єє expected 
N is the‘ expected value ’ of the frequency in the ijth subgroup. This i е2 
value ' is the frequency we should deduce for that subgroup if it consisted simply of individu 
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sampled at random from the general population represented by the N persons whom we 
have observed, and if there were no special correlation between the ith and the jth traits, due 
to some causal influence affecting those traits. Using a notation that has recently come 


into popular use ! let us write ММ; _ E for the expected frequency, and Ni = O for the 


N 
frequency actually observed. Then N (rij — hij)? = 207, and thus expresses the square 


of the discrepancy between the observed value and the expected value for any element in the 
product-sum matrix. Now, if such discrepancies were due solely to chance, we should expect the 


sum of their squares to bedistributed in accordance with chi-squared. In fact, y? = X 2—0) 


is the value we should calculate if ме wished to test the set of observed data to determine 
whether or not they represented the results of random sampling from the general population. 

The meaning of what we have called the saturations for the first factor is obvious. In 
factor analysis the first or * general ' factor describes the general quality defining the entire 
genus with which we are dealing. Из saturations express the relative importance of the 
several traits as manifestations of that generic factor. Here they are actually proportional 
to VN1, Ns» VNn» і.е., to the square root of the frequency with which the several 
traits occur in the population sampled. The factor-measurements for each individual can 
be calculated (if required) in accordance with the usual procedure. They will evidently show 
how far that person is typical of the general population represented by the sample. 

The factor-saturations for the remaining factors can be extracted in the usual way and 


tested by the chi-squared procedure. 

The Conditions of Consistency. Figures descriptive of one and the same popula- 
tion or sample must be mutually compatible, and therefore obey certain restrictive 
conditions. The precise formulation of such conditions has a double value: it 
enables us (i) to detect data that are inaccurate and (ii) to assign limits to the unknown 
values when the data are incomplete. Elsewhere I have ventured to suggest that, 
whether the traits concerned are qualitative attributes or quantitative variables,? the 
most general criterion is essentially the same, namely, that each of the principal minors 


of the symmetric table of product-sums shall be non-negative. 

In Laboratory Notes on Factor Analysis this principle was deduced for quantitative variables ; 
it remains to show that it also holds good for qualitative data. The arguments are much the same 
in either case. Let A; denote a symmetric submatrix, obtained from R (eq. iii) by striking out corre- 
sponding rows and columns for certain variables. Then | В| will be a principal minor of R. Now Ri, 
like R, can be factorized into its latent vectors and roots. Thus, we can write A; = ИД, and 
therefore | Ril = | Lill Vill Li] = Lil (since Li is orthogonal апа Li = 1). When the rank of 
Ri (ri, say) is equal to its order (nj), then V; — Fi' Fi and forms a diagonal matrix in which each 
element consists of the sum of the squares of the saturations. When 7; < ni, the extra 
elements in V; will be zero. Hence, whatever the rank of R, no element in V; can be negative ; 
therefore the value of | R: | is also non negative. gris clearly holds for any principal minor in R, 
and any that can be derived from C., Р Аду: . 

[py unnecessary to test every principal minor. By invoking the Binet-Cauchy 
theorem, it can readily be proved that, if the /eading principal minors of the 1st, 2nd, . je rth order 
are positive, then all the principal minors of those orders will be positives JOY the for ge are the 
minors which automatically appear, stage by stage, when we adopt what " ae called the * factorial 
method ? of calculating partial correlations by hierarchical subtraction. nd it is easy to show that, 
1 E.g., by McNemar (Psychological Statistics, 1949, pp. m 186), weder oai "P 
* It ma Р as іп an analysis of variance the averages ans) are not elimina то 
the very Sora ous they чеге devoid of interest, and as they would be in an ordinary correla- 
tional procedure. They are, in effect, tested a лоса 
"T а itativ i the working criterion : 2 le is that no 
Fu e peine case of quantitative Vae HT principles A ps 
partial correlation can exceed the numerical value of unity : aes ), Свара рро RUD 
and refs. ; also id., * Оп the Theory of Consistence of Logical Class Freq) Е Й AS 


CXCVII, 1901, pp. 917. , 

А due - i i i *: (a brief summary of the 

^ See La * The Solution of Simultaneous Equations : | 

method шогу Noles E MET be found in Vernon's Notes on Statistical Method, pp. 42f.). 
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The Unstandardized Product-sum Matrix. By post-multiplying the table of observed 
frequencies by its transpose, we obtain a product-sum matrix А 
С, = М; M. (i) 
Except for the fact that the values in M, are not standardized deviates, this pro 
matrix corresponds with the more familiar matrix of covariances (or rather codevia M. 
between traits. We can obtain an alternative product-sum matrix by pre-multiplying | of 
by its transpose, namely, Cp = M,'M;. The result will correspond to the ШИШ at 
covariances or codeviances between persons. This second type of matrix, however, WI 
concern us at this stage. E , table. 
The matrix of product-sums for traits constitutes a symmetrical contingency-tab'e. 


E А ` ; о 
Consider, for example, the calculation of the product-sum for a pair of determinates from tw 
different determinables, such as those set out below : 


Hair,fair(j) .. " L111291p9112001 01 t o 
Eyes, light (k) .. - lLrii011iio T1 H TFT 1 1 1 
Product. . i 27 2012110001 Oo OT OO G 1 tT 8 


3 its 
Evidently the product-sum, Nj; = 7, states the number of persons who possess both trait 


(a * second order ’ frequency) ; and this is the value that would be entered in one cell ae 
small contingency-table that “night be drawn up to show the relation between hair-colo 
and eye-colour. (For an illustration of the type of table eventually obtained, see р. 1 to 
However, as the size of the sample (N) will necessarily differ from one investigatio iye 
another, it will be better to express the data in terms not of the absolute but of the reinsi 
frequencies, e.g., pj, = Njx/N say. We can then go on to standardize the data by divi 


e say can the А jonate 
this figure by the standard deviation, VN N/N. In this way we get a set of proportion? 
values which we may designate 


- Tjk = ру Мрурь = Nisl VN;N, . w^ nal 
This is equivalent to pre- and post-multiplying the product-sum matrix by a diagon? 
matrix D-*, whose elemen N 


я ical 
"m uu ваа е may now proceed to factorize this standardized smart 
8 its latent roots and laten k is wi s to expres 
XC observed matris d t vectors. This will enable и 


Adopting 
1 n terms of a set of orth к=п —m-+1). ип 
the usual notation we shall have шыс ыы ы ш 


DiM,—M-Lyip = pp, Gv) 


where P will be an r x N section of an ort i Д - section of 
another orthogonal matrix GL = Г), "ihogonal matrix (EE DB ana 


and Vianr x r diagonal matrix. This gives 

| Ri = MM’ = LVip PVA’ = LVL’ , e 

or (on taking product-moments for persons rather than traits) 
Ro = ММ = PVL yip = ppp, (vi) xi 

! If all the principal minors of a matrix are non-negative. į zero, it is 521 
to be Positive definite (or semidefinite, if its rank is fess thon id a бип огеш it is symmetrica , 
Mdb obe задао di iege s less general property, which characterizes many continget is 
t escribed, is that whi 5 Y. o 
interest for the factorist : (see 6, pp. 68-70, "i E 5.23) ИИИ 
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In either case V will be the diagonal matrix containing latent roots of both R; and R,, and 
Land P will be the latent vectors of R, and Rọ respectively. То obtain appropriate equations 
for calculating P from M, say, W’M = P, we have L'M = L'LVÀP = VIP. 

Hence =V-+L'M; (vii) 


This suggests that we may put W’ = V3 = VIF. 
Values for V and F can be computed by following the usual procedure. By hypothesis the first 
row of P will be a set of hypothetical measurements, one fer each person, and will be so calculated 


that, when multiplied by the first column of the weighting matrix F, the product will be an n X m 
hierarchical matrix giving the closest possible fit to M. Let f be the first column of the weighting 
f and v leads to the introduction 


matrix. The familiar method for solving the equations to obtain ‹ 1 ti 
of v as a Lagrange multiplier. The conditions for minimizing the squared discrepancies then yield 


an equation of the well-known form 
f'(&—v0-0, (viii) 


which will only be satisfied if 

|R- vI| =0. (ix) 
To obtain the first factor, we take the largest value for v, insert 
alues for v in order of magnitude, we 
ated by the iterative method known 


The solution yields r values for v. 
it in equation (viii), and solve for f’. Then, taking the other v: 
find the remaining rows for F’. In practice the values can be calcul: 
as * weighted summation.’ 

Interpretation of Factorial Results. 
closely resemble those proposed by Pearso 


It will be seen that the calculations I have described 
n for analysing a correlated set of quantitative 


traits, such as height, arm-length, leg-length, etc. But, since our starting point has been a 
table of frequencies rather than a table of measurements, we cannot press the analogy too 
far without first considering in greater detail what precisely is the nature of the results thus 


reached. т 
Equation (iv) may be interpreted by saying that the. modified matrix of observed data 

(M = D-! M,) has been expressed in terms of two factor-matrices : (а) a matrix of uncorrelated 
factor-measurements for persons, P, so chosen that the first factor will account for the maximum 
essive factor will account for a 


amount of the variance exhibited by the initial matrix, and each succi а f 
maximum amount of the variance exhibited by each successive residual matrix ; and (b) a matrix of 


‘uncorrelated’ factor-saturations for the several traits, giving the weights to be. applied to each 
factor in reconstructing the observed trait-measurements. If we prefer to consider ourselves as 
factorizing the unmodified matrix of observed data, F, we can write equation (iy) in the form 
M, = (D'F)P, and DIF = D'LV! will now represent the factor-saturations or weights ; but the 
columns of factor-saturations (DF) will no longer be uncorrelated. — е 
Moreover, if in equation (viii) we substitute for R: from equation (iii), we obtain 

Г M, MY Di—v1)-20. 
We can simplify this by writing f" D^ = и and post-multiplying by D!. We thus obtain 

w (M, Му — v D) = 0, that is, w Mı Му — Y wD. (xi) 
Ww i i ation will be obtained by taking w = [1, 1, ... 1] = Ww 
Gir nanese dhat one atate SE od “г wa My =[т,т,... т] (i.e., the sums of the several 
columns in Table 1) = тт; and Wo" Mý = (Ni, Ns... Na] (i.e., the sums of the several rows of 
the same table) = w,/D. We thus obtain 
uM,M/=m Wo’ Му = m wo’ D=vu' р, 

proportionate values for the 


{ ү" + a i dure will yield only the 
In accordance with equation (xi). This proce у But since the elements of /’ 


latent vectors, in this case f” = Wo Di = ГУ Мз VNa s+ 81: nt: 
are e of f A and /' | — 1, We can first calculate values of /" by normalizing the 
above values for f’. This gives 


T Му Ns Nn , since N; 4+ Not.+++ № = №. 
ro B teh Е 


Now f’ = yi [ = 4/ m.l’. Hence we can at length obtain appropriate values for the saturations 


namely, 
7 M [№ ^| : (xii) 
i -| SEN ENN 


i a i forms 

1 thus reached for the saturations con 

Aprea requirements can be formulated quite simply. 
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if the condition specified is fulfilled, then (i) in the case of qualitative data all frequencies up to those 
of the order included in the product-sum matrix must have real and positive values, (ii) in the case 
of quantitative data no partial correlation can exceed the numerical value of 1, and (iii) in either 
case the tables can be analysed in terms of real factors. It will be sufficient to indicate quite briefly 
the general lines of proof for these three statements. 


(i) Higher Frequencies. Since the square-sums of frequencies are necessarily larger than their 
product-sums, the appearance of a negative principal minor would imply that some of the frequencies 
had negative or imaginary values. If the product-sum matrix includes only frequencies of the secon 
order, the condition as stated is necessary but not sufficient. To show that the ultimate frequencies 
have real and positive values, it would be necessary to introduce composite attributes. А 

(ii) Partial Correlations. No generality will be lost if we suppose the correlation matrix ог 
product-sum to be arranged so that the rows and columns for the two correlated variables shall be 
the last. Any variables which are neither to be correlated nor to be partialed out will be deleted from 
the matrix. Then, the partial correlation, 

SS xiii 
V Da-na-n М Dan? ` enn 
where D, the determinant obtained from R after the irrelevant rows and columns have been deleted, 
will be a principal minor of R. Now, if 1 > r?, evidently Dm-in-1) Dun must be > D-1 + 
that is 


Fin- 19.12...(n-9) = r (Say) = 


Da-»0-n, Din-i t 1 
= non-negative. 3 И . (xiv) 
Din-i ‚ Dun | 
-1 
Now D=| Duws-1,20-0 | | Do-nw-n; а-н "EN" . (xv) 
Dorin s Din 


where the divisor must be positive. Hence, if D (the principal minor) is non-negative, then the 
expression on the left of (xiv) must be non-negative, and we have r?< 1; if D is negative, then 7° 7, 
(iii) Réal Factors. Now, whatever factorial methods we adopt to reduce R to a diagonal matrix, 
V say, we can split this diagonal matrix, and so further reduce R to the form R = F I F’, where 
F-LV!. But this further process will in general be non-rational ! ; and if any of the non-zero 
elements in V (and therefore in /) are negative, V! and consequently F will involve non-real elements. 
But, by Sylvester's law of inertia, however we reduce A to the form F I F’, the number of negative 
values in / must remain the same. Therefore, if either of the methods of factorization described above 
yields imaginary elements for the factor matrix F, then any method must yield such elements :, that 


is, it will be impossible to analyse R into factors with real elements. Conversely, if our criterion 15 
fulfilled, then А will be reducible to terms of real factors. 


We may therefore sum up our conclusions regarding the table of product-sums 
as follows : in order that an n x л table shall be (a) self-consistent and (b) analysable 


in terms of r real factors (r < п), it is at once necessary and sufficient that all its 
leading principal minors shall be non-negative. 


Iv. A FACTORIAL ANALYSIS OF PHYSICAL 
ATTRIBUTES 


The Neglect of Qualitative Traits in Studies of Body Types. To illustrate the general 
procedure proposed may I now describe its application in a simple concrete case ? 
various points in my argument will perhaps be most readily grasped if I choose a problem 
from physical anthropology. The characters in question will then be easier to visualize + 
less doubt is likely to arise about which characters are genuinely qualitative ; and at the 


same time it will be possible to show how itati х in the 
T Ч quantitative measurements тау be included in 
same series and subjected to the same general treatment. Ld 


Recent discussions about the relation between temper: а hysical character- 
istics have been limited almost entirely to those physical verbere ees like height 
and weight, may be treated as continuously graded, quantitative variables ; they have tende 
almost entirely to overlook the possible relevance of qualitative characteristics, such as 
eye-colour, hair-colour, or peculiarities in the shape of certain features, like the nose ог chin 
(cf. 10 and refs.). Ina later paper I hope to examine what verifiable support there may be for 
popular and traditional notions about the value of such traits as a clue to individual character. 
Here my purpose is primarily to illustrate a practicable techni i f ressing, in terms o 

factors ' or type-tendencies, any signifi UE гог ERP ч 


cant iati Т g the 
physical traits themselves. associations that may be observable among 

1 Cullis, C. E., Matrices and Determinoi а è 
footnote 1. eterminoids, YI, p. 293, Theorem III, and Factors ofthe Mind, p- 307 
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The Data. nan early investigation which I commenced some years ago, Т endeavoured 
to determine more particularly whether there was any evidence for such associations in the 
case of those traits that were commonly supposed to characterize certain * racial" types 
(cf. 7, pp. 23, 27, and refs.). For this purpose I collected observations on the characteristics 
most frequently used in classifying individuals into such groups, namely, eye-colour, hair- 
colour, stature, shape of head, shape of nose and chin, and type of hair. Неге it will be 
unnecessary to defend or define the various terms adopted: as regards stature it will be 
Sufficient to say that persons were classed as ‘ tall ’ if their height was above the average for 
the general population ! and ° short > if it was below the average. Under ‘ wide’ heads I 
here include all with a cranial index over 75 (that is, mesocephalic heads, as well as dolicho- 
cephalic in the stricter sense). To shorten the table, I have also pooled certain traits and 
omitted others. 

The data were collected at Liverpool, a district where representatives of different nationalities— 
Welsh, Irish, Scots, as well as foreigners—were easily accessible. In most cases temperamental 
assessments were obtained at the same time ; but these will also be omitted from the present tables, 


as they raise somewhat different issues. In all, 217 individuals were examined, about two-thirds 
of them males. But, partly to simplify the calculations and partly because the later observations 
nalysis to the data obtained from the last 


were rather more trustworthy, I shall here restrict my а: 
hundred males in the series. 

The Crude and Standardized Contingency-Tables. The number of persons characterized 
by the several attributes specified, taken in pairs, are set out in Table II (this may be taken 
as representing the initial type of contingency-table which was designated C,on p. 172 above). 


TABLE Il. OBSERVED FREQUENCIES 
Number of Persons exhibiting Characteristics Specified 


Trait F RD |T| L M B | Tot. N W |To.| T S | Tot. 
Hair 

Fair .. 22 0 0 22 14 6 2 22 14 8 22 13 9 22 
Red .. 0 15 0 | 15 $ 3 рур ЕЕ Е 5 | 15 
Dark .. 0 06 | 63 | и 25 27 | 8 44 19 | 63 | 20 43 | 63 
Total — .. 22 15 63 |100 | 33 36 31 100 | 69 31 |100 | 43 57 | 100 
ur 3 | 97 6 | 33 | 29 4733 
Light .. 4 si | 33 | 33 0 0 3 

Mixed .. Г: 5 25 | 36 0 36- 0 | 36 | 20 16 | 36 | 10 26 | 36 
Brown .. 2 292 31 0 0 31 31 22 39 31 427 31 
Total ы 22 15 63 |100 | 33 36 31 100 | 69 31 |100 | 43 57 |100 
Heap 4 69 | 69 о | 69 | 30 39 | 69 
Na 27 20 22 

Wide. E: п 19 EH 6 16 9 | 91 0 31 | 31 | 13 18 | 31 
Total я 22 15 63 |100 | 33 36 31 |100 | 69 31 | 100 | 43 57 |100 
STATURE 

Tall . 2 43 | 29 10 4 аз | 30 13 | 43 | 43 0| 43 
Short .. B 0 2 51 4 26 27 | 57 | 39 18 57 0 57 | 57 
шы еъ © Sb SEE ESL. 69 31 |100 | 43 57 [100 
and Wales (the means were taken 


‚ By the : tion I mean the population of England à : 
пою he general рори о Committee of the British Association, The am t ЖО рз, Pp. 
Sf). Th iti f the set of traits originally use also, fo t 
the same асе The original classification of hair- and SEO): pes Some Ei 
claborate than that shown in the present tables (cf. 11, P- 150, 12 and re! 5); ple i 
small to i 5 fine subdivisions. . А d s 
оа чет үр the four characteristics set out in тые H will e E 
[Eee has observed, “ the classical anthropological be quce Rib epi PEU be 
ased almost entirely on the cephalic index, stature, and hair © А ШЕЕ S 
doubted that ШЫГУ ры clear distinctions between the major groups in the continent as a whole. 
( E of Central Europe, 1939, p- 135.) 17 
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Я i he 
in by dividing first the rows of Table II, and then the resulting columns, by t 
pU of Че mutter in the corresponding diagonal cell (the number a Hu 
characterized by each of the eight attributes). _ This converts the number іп eac. n jaana 
ceil to unity. The resulting figures are shown in Table III, which corresponds to the › 
R,,p.172. This is the matrix that we shall now proceed to factorize. 


TABLE Ш. RELATIVE FREQUENCIES 


Sr 


ATURE 0 
Tall .. ore 423 -394 -384 7710-254 -110 *551 :356 | 1:000 Ma 
Short e | "254 171 718 092 -574  .642 | .622 -428 ۰000 1:00 

| 


Factor Analysis of Relative Frequencies. The saturations for the first factor, it will 
be remembered, can be determined by simply 


dividing the square root of the number si 
persons characterized by each attribute (Ni) by the total number (N = 100). The Eae 
so calculated are set out in the first column of Table VIIA. As may easily be verified, the 
factor-variance, i.e., the sum of the Squares of these values, is m = 4. The reader may 
find it instructive to check the fact that t 


he saturations correspond to the first principal 

component, by multiplying each row of Table Ш (or, say, the first two or three figures fr em 

each row) by the corresponding saturation, and dividing each weighted sum by the factor 

variance, namely, 4. The results should be equal to the factor-saturations employed. -— 

A hierarchy is now constructed by multiplying cach saturation in turn by all the satura 
lions. This is subtracted from Table II ; 


and the residuals so obtained are given In 
Table IV. 
TABLE IV. FIRST RESIDUALS 
Trait E PIE M EIN CIT S 
HAIR. | | 
Fair —182 —-372 250 —-068 —-185 
Red.. 


:850 —-370 137 —017 —-123 


Dark —370 .370| —215  .049 :169 


STATURE 
Tall .. 
Short 
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A minor peculiarity in the three foregoing tables deserves a word of comment. In the diagonal 
submatrices of Tables II and III, all the values except those in the leading diagonal are zero, and 
P us “* all the frequency is concentrated in the diagonal compartments." ! This is plainly due to the 

act that all the alternative determinates under one and the same determinable are, by hypothesis, 
mutually exclusive : that being so, of the 22 persons who are entered as having fair hair none can 
also be entered as having red or dark hair. In the corresponding cells of Table IV the negative 
residuals indicate that, after allowing for the effect of the first or general factor, we find that any 
sendeney to grow fair hair inevitably has a negative effect on the tendency to grow hair of any other 

The next step is to test the residuals for significance. For this purpose we must obviously 
confine our examination to those residuals resulting from figures that have been actually 
observed, i.e., those appearing in the non-diagonal matrices, relating to different determin- 
ables. If the reader calculates the sum of the squares of the figures in each of the side sub- 
matrices in Table IV, and then multiplies that sum by N = 100, he will find that the results 
agree exactly with the figures obtained for chi-squared when calculated in the usual way 
from the corresponding submatrices in Table I. The six values thus obtained are shown 
in Table V. АП except those for the association of head-shape with hair-colour and stature 
are significant. The total value for chi-squared comes to 78:35, which, with 13 degrees of 
freedom, is highly significant. 

TABLE V. CHI-SQUARE TABLE VI. POINT-CORRELATIONS 


Deter- HAIR Eyes HEAD STATURE Deter- Hair Eyes HEAD STATURE 

minable minable 
Hair .. — 21:26 0:45 9-01 HAIR .. 1:000 386  —:043 :284 
Eyes ., 21-26 — 5:63 41:98 EvES .. :386 1:000 +195 +553 
Heap .. 45 563  — 02 Heap .. | —043 -195 1:000 014 
-01 я ЅТАТОКЕ 284 553 014 1:000 


اا 


arded as expressing partial or specific associations, 


The residuals in Table IV may be reg: 1 
- and the process repeated. Since there are only seven 


They can be factorized in their turn ; 


degrees of freedom, only seven factors can be extracted. All except the last are fully 


Significant. The saturations ° are given in Table VIIA. Most of the factors can readily be 
interpreted. The first is a general factor, i.e., all its signs are positive: as we have already 
Seen, it specifies the general pattern of traits (or trait-tendencies) which characterizes the 
particular sample of the British population we have been examining : in the main they tend 
to be dark-haired, of mixed eye-colour, dolichocephalic, and short rather more frequently 
than tall. The second factor is bipolar: it indicates a contrast between (a) a complex 


tendenc: i i ight eyes with tall stature, fair or red hair, and a narrow head, 
ү hich pomine E t stature with dark hair, brown or (less 


and i псу to combine short а; " > 
athe „opposite gu P ad a broad head : of the two antithetical types light eyes in 
the one case and short stature in the other appear to be the traits which are most diagnostic. 
The third and fifth factors appear to contrast a brachycephalic tendency (and its associated 
characteristics) with a dolichocephalic ; the fourth to contrast the red-haired type with the 
fair-haired ; and the sixth suggests a minor type combining tall stature with dark hair and 
light or mixed eye-colour. 


I do not suggest that the p 
analysis. For one thing, severa 
in factors III and IV, suggest that (il 
factor analysis or a method of subdivided factors wou 
loc. cit., p. 65. 

d by the method of 


icular is given is el a i he best 
articular analysis given 1S either a final analysis or t 5 
1 of the factor-saturations, е.5., those for fair and red hair 
at (if only a slightly larger number was available) a group 
Id yield a clearer picture. Further, the 


* The description is quoted from Yule, 


» а weighted summation. They therefore 
mec dec as preemies SE ЕНУ, the reader, if he wishes, can apply Bartlett’s test 


represent * princi nts.” н д Ч 
re c CEN NEC аата д 
enis iate, since thi r 0 
1 pone ОЕШУ indebted to Mr. Arthur Summerfield for assistance with the 
computations. 
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large saturations (over 0-6) that stand in isolation—for eye-colour in factor II, head-shape E 
factor Ш, and hair-colour in factor IV—tempt us to ask whether we might not secure a tru r 
result if, instead of retaining 1-000 in the leading diagonal, we substituted a reduced figuri 
which would exclude specific factors. But to discuss these further issues would take us too 
far from our immediate task. 


TABLE VII. BIPOLAR FACTOR SATURATIONS 
— A ЫЫ ае جم صا‎ 


A. Analysis of Relative Frequencies | B. Analysis of Point-Correlations 
| Factors Factors 
Traits | Traits | D 
1 Dm IV V VI wr} A B С 
HAIR | | | 
Fair | "469 420 231 —-602 —-100 —-404 -119 | -410 
Red ‘387 -333 —-019 -751 -182—376 -012 | HAIR ۰625 «508 429 — 
Dark | 7794 —411 —-128 — -011 —-030 -422 —-076 | 
Eyes | | 
Light | 574 719 —.164 —104 -042 -084 —:325 | 410 
Mixed | :600—241 -563 227—442 .048 :125 | Eves | 820 —176 —-478 — 
Brown "557 —482 —437 —-138 443 —-138 :201 | | 
HEAD | | | 10 
Narrow | 831 -081— 428 -039—-338—-044 -056 | Heap 448—709 478 ^4 
Wide | :557 —121 -638 —-058 -504 -065 — :084 | 
STATURE з | | 410 
Тап | 656 -619 -010 -047 -120 -335 -241 | STATURE 11 878—429 3 
Short | 7755 —-537 —-009 —-041 —-104 —-292 —-209 | | 


——  ÁÁÓ( MN VO MN ا‎ аы 
Factor Analysis of Point-Correlations. As we have already seen, in dealing with qual- 
tative attributes of a dichotomous type, i.e., in cases where each determinable furnishes only 
two mutually exclusive determinates, a set of product-moment correlations derived from 
fourfold tables will commonly provide the simplest basis'for factor analysis. But it 15, d 
think, easy to show that for data of the present type, where several of the determinables lea 


to a multiple classification, this method is far from satisfactory. However, let us briefly 
examine its application here. . 


In Table II, one of the submatrices (that for Head and Stature) involves a single degree of freedom 
only. For this pair of variables we 


erminates have a point-distribution. Where the submatrices contain more 
> can, if we wish, calculate a coefficient of contingency by Pearson’s formula, A 
better, allow for the increased degrees of freedom, by dividing the corresponding ọ by the fourth Ие 
ofthe degrees of freedom." But it is better still to pool the initial data se ne to secure a fourfold ta 
for every pair of determinables from the outset. In view of the results just obtained we may plausib М 
group fair and red hair together ; for eye-colour the original records make it possible to divide mos 
of those classed above as * mixed ’ either with the * light" or with the * brown eyes ` respectively, ane 
the rest may be divided in due proportion.? We can then calculate point-correlations for the resulting 
fourfold tables. In doing so, I have taken what we may call Nordic traits (fair hair, light eyes, narrow 
head, and tall stature) to be positive qualities. The coefficients thus obtained are shown in Table VI. 
1 This expression has been proposed by A. 


d 
0 A. Tschuprow (8). It appears to have been suggeste 
by the divergence coefficient ’ of Lexis (1) : see also 13, p. 325. Here the coefficients so calculate 
are in moderately close agreement with the Ф 


-coefficients set out in Table VI. В 
* It might perhaps be argued that we could preserve the distinction between fair hair and red hair 
by constructing two sets of fourfold tables for hair-c i 


‘ : 1 

г d nA or not-red (i.e., fair and dark). But both logicas 
considerations and the empirical results make it clear that this alternative procedure only increas 
rations so obtained. 
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These coefficients can readily be factorized ; and the saturations, calculated by si i 
c 1 #8 Ы imple sı 
кешен in Table VIIB. It will be seen at once that, without the more detailed Касы Арта оп. 
the E method previously described, it would be very difficult to arrive at a convincing interpretation. 
s SE factor (A) obtained from the table of point-correlations obviously corresponds, as we should 
ree , with the second factor (II) obtained from the table of relative frequencies: thus factor A 
шене а$ а general factor what factor II expressed as a bipolar factor, namely, the tendency of the 
i r positive (Nordic) traits to go together. Factor D expresses the contrast between (a) the combina- 
us n of dark hair and dark eyes and (5) wide head and short stature shown by the saturations for 
3 ese qualities in factor Ш. Factor C expresses the contrast between (a) the combination of dark 
10 and wide head and (5) the combination of brown eyes and short stature suggested by factor IV. 
actor B apparently expresses the contrast between (a) the combination of red hair and tall stature 
and (c) light eyes and narrow head expressed by factor IV. However, with only four determinables, 
and with unity in the leading diagonal, it might well be maintained that the last three factors mainly 
represent specific factors expressed in bipolar form (see 10, pp. 112f.). 
А It must therefore be admitted that cases of manifold classification—i.e., determinables 
involving more than two determinates—cannot as a rule be satisfactorily treated by forcing 
the multiple determinates into twofold form and then analysing the coefficients derived 


from the resulting fourfold tables. 
TABLE УШ. GROUP FACTOR SATURATIONS 
Analysis of Point Correlations 


| act 
Traits | Easton 
| A B С р 
= | 
HAIR |—043 999 000^  -000 
EYES | -195 -395 457 773 


vs 000 -000 -000 
STATURE | 014 +285 "958 -000 
ا ا‎ 

It is always instructive to supplement a bipolar analysis by a group factor analysis. The 
Patterns exhibited by Tables VI and VIIB, suggest that the most appropriate procedure will 
be that which 1 have described elsewhere for cases in which the variables reveal a * cumulative 
complexity.’ The result is a matrix of ‘completely overlapping ' (as distinct from * partly 
overlapping ’ or * non-overlapping °) group factors (Table VIII). Here the implication of 
the figures is that the first classification should be based predominantly on head-shape ; 
the second on hair-colour, eye-colour, and stature, with greatest emplfasis on hair-colour ; 
the third on the joint variation of eye-colour and stature, regardless of hair-colour ; and the 
last on divergences in eye-colour from that indicated by the preceding classifications. 

The Examination of Ultimate Frequencies. Had we been factorizing a correlation table 
of the ordinary kind for quantitative variables, we should, after making a bipolar analysis 
like that shown in Table VIIA, have proceeded at once to take the bipolar factors as indicating 
the way in which the several traits might be combined into groups, and then use this classi- 
fication as the basis for the analysis into group factors. With qualitative traits a more direct 
procedure is available for determining what groupings are most significant. 

i Y i and modifying his definition slightly so as to include those cases 
isse ihe s t acids i ШЫ, let us term the classes specified by all the determinates that 
could be observed in any case, i.e., the classes of the mth order in the case of m determinables, the 
* ultimate frequencies. As Yule points out, `“ the ultimate frequencies form a natural set in terms 
1 З e factor-matrix so obtained has the form of a 
Prou ta the Minde Da P be evide: re-arranged in the order described in the text 


(Head, Hair, Stature, Eyes). The imp ustrative only : to have any genuine 
value they,should be based on a far larger num " ácm 

2 The an à by Yule (6, p. 11), specifies * attributes ' only Jos mat distin 
Ns ici pop reet е from those which аге pea Dos sii dE 
required roblem with which he chiefly deals, nam У, | : 

(екпе dr аы m case of many determinables, and two determinables only in the case of 


manifold determinates. 8 
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As there pointed out, th 
nt if the traits are a 
lications suggested are ill 
ber of traits. 

; he does not distinguish 
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of which the data are completely given” (ibid., p. 13). With a large number of determinates (such as 


may arise in certain types of test) it would be exceedingly laborio 


us to examine all the ultimate fre- 


quencies in turn. However, in such cases the preliminary bipolar analysis would help us to pick out 
those sets of traits which go together most closely and are likely to form significant mth order combina- 
tions. In the present case both m and the лг are small numbers ; and consequently the total number of 
mth order combinations, namely, л, X п» X ms X ny = 3 X 3 X 2 х 2 amounts to 36 only. Here 
therefore it will be as simple as it is instructive to set out a// the 4th order combinations, and examine 


the significance of each one. 


In Table IX (last column but one) I give the frequencies for each of the 36 composite 
categories. The number expected by chance is shown in the last column. The probability 


TABLE IX. THE DETERMINATION OF SIGNIFICANT GROUPS 


Е a 


ULTIMATE 
HAIR EYES HEAD STATURE FREQUENCIES 
Obs. Exp. 
Fair Light Narrow Tall x 8 21 
» » » Short .. 4 28 
» » Wide Tall 2l 1-0 
ээ n » Short .. 0 ra 
Total 14 153. 
á Mixed Narrow Tall .. 1 23 
» » E Short .. 1 31 
» sx * Wide Tall .. 2 11 
» » » Short .. 2 1-4 
Total 6 7:9 
m Brown Narrow Tal .. 0 2:0 
» » К Short .. 0 27 
» » Wide Tal .. 0 0:9 
» » 1 Short .. 2 12 
Total 2 68 
TOTAL 22 220 
Red Light Narrow Tall .. 6 1:5 
» » „ээ Short .. 0 19 
» » Wide Tal .. 2 0:7 
” » » Short .. 0 0:9 
Total 8 5:0 
5 Mixed Narrow Tall 2 1:6 
» » " Short 1 21 
» » Wide Tall 0 07 
m » s Short 2 го 
Total 5 5-4 
5 Brown Narrow Tall 0 14 
i wid Short 9 1:8 
^ ” ide Tall 0 0:6 
E E 5 Short 0 0-8 
Total 2 46 
TOTAL m 2 me e ўа 5 15 150 
Pc ج‎ 
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TABLE IX (continued). 
ULTI 

HAIR EYES HEAD STATURE а 
Obs. Exp. 

Dark Light Narrow Tall .. т us 9 61 
” » v Short .. РА E 0 8-2 
” нр Wide Tall. zs ке ES 2 28 
» » ә Short .. sie oy 0 $77. 
Total 1 20:8 

» Mixed Narrow Tal .. s ‚4 3 67 
» x = Short .. АГ s4 12 89 
» % x Wide Tall x. 2 3:1 
y 5 "s Short .. 8 40 
Total 25 2277 

» Brown Narrow Tall .. 3 T 1 58 
» " 3* Short .. ud s 19 77 
” 3 Wide Tall .. 3 26 
n к " Short .. 4 34 
Total 27 19-5 

TOTAL 8 63 63:0 
GRAND TOTAL .. 100 100-0 


ba a given person will be at once fair-haired, light-eyed, dolichocephalic, and tall will be 
2 33 69 „ 43  .0215. Accordingly, in a sample of 100 we should anticipate 


100 Х 100 < 100 * 100 ~ pns 
that only 2-15 persons would exhibit that combination. Actually there are as many as 8. 
As the theoretical proportions are small we can take the standard error of the discrepancy 
to be approximately equal to the square root of the expected number, namely 4/2-15. Hence 
in this case the excess over expectation is unquestionably significant. 

A glance down the two columns will show that the following combinations may be 


regarded as fully significant : 
(i) Fair hair, light eyes, narrow skull, tall stature. 
rrow skull, tall stature. 


(ii) Red hair, light eyes, na 
(ii) Dark hair, brown eyes, narrow skull, short stature. 
The following fall somewhat below the level of significance in the present group, but 
were fully significant in the larger group of 217 : 


(iv) Dark hair, eyes of mixed colo! 
(у) Dark hair, light eyes, narrow S 
In the larger group one further combination nearly 


namely : 
(vi) Fair hair, light or mixed eyes, broad skull, tall stature. 
The first two combination rieties of the so-called Nordic type ; 


s are suggestive of two familiar уа | 
th ird i -called Mediterranean type; the fourth of the so-called Alpine or 
the bird i suggestive О recogni d d to result from a crossing of Nordic 
1 h is suggestive of the so-ca 


and Mediterranean types ; th lled Beaker-folk. I may add that it 

was noted at а ime that persons belonging to he first two groups frequently redo IA j 

sucl istri rkshire, where tall, fair- с ; 

ed e же жү їсгап Tied members came rom Ireland (chiefly north-eastern areas). The 

third grou included a large proportion 1 Gs 

dominantly Welsh, and the fifth (a very small group) predominantly Irish ; 
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ur, broad skull, short stature. 


kull, tall stature. 
reached the level of significance, 
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i igi i igi i fifth determinable ; 
from Wales. In my original analysis, place of origin was also included asa termi 
m numbers in the resulting subgroups were too small to yield significant figures. Mincationand 

mixed marriages likewise made it clear that, with the data available, a satisfactory classificatio 
regard to ancestral origin, which alone is really relevant, was scarcely possible. - 
igni inati i i it will be seen that in 
If we regard each significant combination of traits as resting on a factor, it will tg 
the main ihe results thus reached restate, in group factor form, the classification already arrived 
at by the bipolar analysis. It is obvious that we could pass from one type of analysis to the o! 
by rotation ; and this would yield appropriate weights for the group factors if required. 
Finally, let me say that in this short paper I have only been able to touch very briefly on a кү 
of the points raised by the problem I have been discussing and to Sketch, in the barest outline, one 
the many possible solutions. 


V. SUMMARY AND CONCLUSIONS 


1. It is argued that, if factor anal 


ysis is to be adequately employed in problems 
of individual psychology, a rigorous 


statistical treatment is needed for qualitative 
data no less than for quantitative; and it is claimed that, with minor BW v agen ^ 
factorial methods can be applied, as legitimately, as readily, and as fruitfully, wit 
the former as with the latter. 


2. The various techniques available for е: 
and briefly discussed. It is concluded tha 
are assumed to rest on a discrCte point-distr 
for point-distributions (o) 


xamining qualitative data are enumerated 
t, with dichotomous classifications that 
ibution, the product-moment correlation 
will, in general, prove the most useful. 

3. Cases of manifold classification 
involving many mutually exclusive de: 
by previous writers, For these cases, 
as ordinarily calculated, is unsuitable. 


‚ in which there are many determinables, cach 
terminates, have been almost entirely ignore 
it is shown, the point-distribution coefficient, 


4. The general methods here proposed for such cases rest on the assumption 
that factor analysis can be validly adopted with any type of symmetrical positive- 
priate for most purposes proceeds 
to a factor matrix containing one 
sion of bipolar factors. The theoretical number of 
analysed, which will always 
5 er of attributes. It is shown that this mode of analysis 
permits of employing the chi-squared criterion for testing the significance to the 
residuals obtained. - 


S: The group factor method also can be a 
frequencies. But more instructive results can be obtained by examining the ultimate 
frequencies, or (with large tables) that particular selection of ultimate frequencies 
which the bipolar analysis indicates is lik 
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. Kelley, T. L. (1947). Fundamentals of Statistics. 


. Burt, C. (1949). ‘Alternative methods of factor an 


CYRIL BURT 


REFERENCES 


. Lexis, W. (1886). * Uber die Theorie der Stabilität Statischer Reihen,” Jahrb. f. nat. Ók. u. Stat. 


XXVII, 604—48 


. Yule, G. U. (1900). * Оп the association of attributes in statistics,’ Phil. Trans., A, CXCIV, 


257-84. 
Pearson, K. (1900). ‘On the correlation of characters not quantitatively measurable,’ Phil. 


Trans., A, CXCV, 1-87. 


. Pearson, K. (1904). * On the theory of contingency and its relation to association and normal 


correlation,’ Drapers Company Research Memoirs, Biometric Series, І. » 
Lipps, G. Е. (1905). ‘Die Bestimmung der Abhängigkeit Zwischen den Merkmalen eines 
Gegenstandes,' Ber. d. Math. phys. Klasse d. Kön. Süchs. Gesellsch. d. Wissenschaften. 


Yule, G. U. (1912). Introduction to the Theory of Statistics. 2nd ed. London : Griffin. 


. Burt, C. (1912). * The inheritance of mental characters,’ Eugen. Rev., IV. 
I, 199-256, II, 


Зеро A. A. (1918-9). ‘Zur Theorie der Stabilität Statistischer Reihe,’ 
-133. 
London : P. S. King (2nd ed., 1946). 


. Burt, C. (1921). Mental and Scholastic Tests. 
. Burt, C. (1944). * Тһе factorial study of physical types,’ Man, XLIV, 82-6. 
. Burt, C. (1945). ' The relation between eye-colour and defective vision,’ Eugen. Rev., XXXVII, 


149-56. s А 
* Defective colour vision in relation to pigmentation and hair,’ 


Mi -3. 

"NU Cambridge, Mass. : Harvard University 
Press. 

tc alysis, Brit. J. Psychol., Stat. Sect., ЇЇ, 


98-121. 


185 


A NOTE ON SHELDON'S TABLE OF CORRELATIONS 
BETWEEN TEMPERAMENTAL TRAITS 


By A. LUBIN 
University College and Maudsley Hospital, London 


I. Problem. П. Criterion. III. Results. IV. Summary. 


I. PROBLEM 


; еп 

In a recent paper (1) Adcock has discussed the table of tuve es vas d 
temperamental traits published by Sheldon in his book on The Varieties of analysis 
ment (6), and reports considerable difficulty in finding a method of factor a aloe 
which will account for the table as it stands. The correlations are тергей ositive 
in Table I. As Professor Burt has pointed out, the symmetrical pattern o ро tried 
and negative signs is unlike that found in any other published table. Adcoc -olete 
various methods of factor analysis. In each case it proved impossible to cor R 0 
the analysis since it led to illegitimate procedures such as taking the square ТӨӨ е 
à negative number. His own suggestion was that, to explain Sheldon's three fated. 
variables, a complex interaction between four factors would have to be postu len 
“ The four factors ” (described in the closing paragraphs of his article) “ seem to able 
themselves to precisely the sort of interaction which we suspected, and seem cap 
of explaining the unusual form of the correlations.” yes : 

There is, however, a simpler hypothesis. Farlier in his paper Adcock piss this 
“ obviously there ‘is Something peculiar about these inter-correlations.” Wit 


; to 
remark we may readily agree. And the peculiarity is so great that one is forced 
ask whether it may 


es e 
not be outside the bounds of mathematical possibility. eee 

the important question to ask is not: can these correlations be factorized ? but ra 
the prior question : are these correlations mutually compatible ? ldon's 
In the following note, it is proposed to demonstrate that at least three of She а 
product-moment correlations could not be obtained simultaneously from any а 


М ry iti for 
Set of measurements : in short, that the table violates the well-known conditons A 
Consistency. It follows that some at least of his figures must contain err 
arithmetical calculation. 


II. CRITERION 


ted in correlation tables in virtue of emp 
St obtain for any set of inter-correlation Lo 
arly paper points out that, if rj, and пз 97, 
be laid down for r,,: (8, р. 250 and refs. ; cf. 3, р. 

i of ry, and гуз as follows : . 

PS Fas ать VI — п? VI — r19 + @ 


8 " " i Vas? 
regoing equation results from the fact that the partial correlation, 7s: 
cannot exceed +1 and cannot be less than — > 


ela- 
- This is true for any set of product-moment corr е 
tions (cf. 3, р. 330, and 8, pp. 238f). The same conditions, as Burt has Болга ош, сап also be 
derived from the fact that the Squared multiple correlation cannot exceed unity. Yule goes О ore 
prove that analogous conditions obtain for the Consistency of the correlations of between four or m 
А : this can readily be deduced from the fact that 12:31 12:315 , Etc., must lie between 
and —1. 


Such errors ca 
inevitable boundary c 
three or more variables. 
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Correlations between Temperamental Traits 


i i ү ith their 
are concerned solely with the consistency of Sheldon's figures, and not wit 
ee can disregard the positive correlations within each of the three main eroi 
V, S, and C, and scrutinize only the negative correlations between the different group S Др; 
form the problem might be stated thus : wecan say that two things are the opposites сас 9 Н 
when (if ever) can we say that three things are opposites? The answer evidently is t on 2 ie 
opposition is more or less incomplete. If opposition is expressed in quantitative form у 11920.50 
correlation, then (as the formula shows) three negative coefficients, all numerically larger i i io te 
would be impossible. Accordingly, in Table I any negative values well over —0-60 are likely 


highly suspect. | Е 
To obtain a decisive test, partial correlations involving three variables at а = 
were computed for each of the triads in Sheldon’s correlation table that арреагес 
suspicious. The well-known Pearson-Yule formula for partial correlation wa 
employed, namely, 
Fig — ligo 
Tirs x 


=== А i .(i) 
7 М1 =r? V1 — Fas? ( 


III. RESULTS 
Out of nineteen partial correlations computed in this manner, as many as five 


furnished negative coefficients numerically larger than —1. The detailed figures аге 
set out below in Table II. 


TABLE П. PARTIAL CORRELATIONS SHOWING IMPOSSIBLE VALUES 
— ——— M P € 


Correlated Trait Partial 
Traits Eliminated Correlations 
V8, S5 C8 -—18 
V8, S6 C1 — 1:09 
V8, S6 C8 —1-22 
V9, S3 C1 — 1:04 
V14, 53 C1 —1-16 


————— M ا‎ LÀ 


It is evident that, if only one triad were found to give an impossible partial correlation, then 
least one of the three first-order correlati 


ons would be in error. In the present case, the E 
Suggests that, at the very least, three first-order correlations must involve errors of computa iof 
With the data available it is impossible to determine with certainty which coefficients are in fons 
and which may be safely accepted. The large negative values printed by Sheldon for the orrelan ihe 
between $3 and СІ (—071), S6 and СІ (—0:69), and V8 and C8 (0-31) would account fora two 
anomalies. Or again, since the variables V8 and C1 occur three times, and one or other of the 
appears in each of the five cases, mistakes over these two would suffice. :ance. 
It will be noted that no question of linearity, sampling error, normality, homogeneous variance, 
etc., is involved. The matter is simply one of arithmetical accuracy. 
The results emphasize the supreme im 
which include a complete system of check: 
Journal (2) makes it plain that it is not 
inaccurate : errors can of course slip th 


£ us errors will result if one of the standard 
checking methods is adopted. Such methods should be treated as a matter of routin 
with all data intended for publication. 
Finally, 


I wish to thank Mr. A. R. Jonckheere for assistance in the calculations 


made for this paper, and Professor Burt for guidance on several points, including the 
reference to Yule. 
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IV. SUMMARY 


1. Difficulties have been experienced in factorizing Sheldon's table of correlations 
between temperamental traits. It is suggested that this is due to peculiarities, not in 
the underlying factors, but rather in the manner in which the correlations have been 
obtained : the pattern of negative coefficients is quite exceptional, and some of the 
coefficients appear to be arithmetically impossible. 


2. This is confirmed by applying Yule’s criterion," namely, that, in an empirical 
table, no partial correlations can exceed unity. Atleast three of Sheldon's coefficients 
appear to be erroneously computed. 


3. Asa corollary it is suggested that all correlations should, as a matter of routine, 
be calculated by some method which permits of arithmetical checks. 
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the importance of checking the arithmetical 
‘ons and to show that those in Sheldon’s 


1 : ; 
In this article my chief purpose has been to demonstrate 
ts out that with other tables the criterion 


accuracy of the figures appearing in any table of inter-correlati 


table are mutually inconsistent. Professor Burt, however, poin h les t [ 
given above in seston Û) might be satisfied perfectly, and yet the coefficients remain inconsistent. 
For example," he writes (in a personal communication), * а very slight change in Sheldon's table 
would suffice to reduce the partial correlations of the first order to an acceptable size ; and yet, owing 
to the large negative correlations between Sheldon's S’s and C's, it would still be impossible to 
analyse the table in terms of real factors." The more general criterion, that he has suggested elsewhere, 
Consists in showing that all tie leading principal minors are non-negative : when this has been proved, 
the matrix is positive definite (or semidefinite). These minors emerge automatically when the partial 
Correlations are computed by what he calls the ‘ factorial method ’ (cf. Factors of the Mind, p. 307, 
BS Laboratory Notes on Elementary Statistics, 1941, sect. 8): with this method therefore no very 
aborat ati required. А б " Б 
І should libe to a yanii cem ay or noting two errata in my previous article, this ru n 
рр. 95 and 102. On p. 95 the plus sign under the radical should be minus ; on p. 102 in the formula 


for c, read VN for М. 


189 


BOOK REVIEWS 


Human Ability. By C. Spearman and Ll. Wynn Jones. Macmillan & Company. 
Pp. vii + 198. 16s. net. 


This slender volume embodies Professor Spearman's last work. It has peg m 
awaited, and is addressed to all psychologists whether they happen to be intereste dto 
statistical approach or not. Written in collaboration with Dr. Wynn Jones, it ee of the 
comprehensive summary of Professor Spearman’s final views on the factorial stu A The 
mind—its methods, controversies, and results, and their psychological i gum pe EM 
chapters suffer a little from the attempt to compress so much material into so small ^ PE ч 
and at times, in the effort to avoid excessive technicality, the writing drops into PIS ae it. 
metaphor and loose colloquialisms, which often obscure the meaning instead of clari К a 
The manuscript was unfortunately left in an unfinished state ; and, partly no dou always 
result, the text is somewhat inadequately annotated, and the references are not à 
complete or exact. " be allotted 

The problem with which the writers are chiefly concerned is the place to be mewhat 
to ‘group factors. Those who are familiar with recent literature will be so nfused, 
disappointed with the outcome of the discussion, for the views expressed are rather co twenty 
and show little advance beyond the position adopted in The Abilities of Man COD eae his 
years ago. There, it will be remembered, Professor Spearman, while still defen ШЕ 8 
celebrated theory of two factors, at the same time tried to leave a loophole for * excep 
cases,’ should they be finally confirmed. k ; entially 

In the present volume, we are assured, “ the psychological picture still remains ess erst 
thesame." Nevertheless, in the summary to chapter 11 (headed “ Theory of Two Fac encra 
the writers tell us that “ the statistical results of this method . . . consist of (1) а d group 
factor ; (2) an unlimited number of narrow specific factors ; and (3) very few broni Et the 
factors." This formulation is almost identical, word for word, with the statemen! at the 
“ three-factor theory," put forward by Burt over thirty years ago,! which Spearman factor 
time so strongly criticized. But how, the puzzled reader is tempted to enquire, can a tw cepted, 
method yield three kinds of factor ? And, even if “ very few ” group factors are to qe c 
this nevertheless means that the two-factor theory, which postulates general an 
factors only, is no longer adequate. . f supple: 

Elsewhere, it is true, the authors try to set up a distinction between the kind о bilities ’ 
mentary factors they are prepared to admit and the ‘ group factors’ or ‘ special al nto the 
of earlier writers, Group factors, they say, must always depend on the tests put em the: 
battery, and consequently cannot represent innate capacities. Their own facto ly dis- 
prefer to describe as * broad specific factors" which, we are told, ** must be sharp tly) 
tinguished from what have been called group factors," although, a little inconsi* 
they are termed ‘ group factors? in chapter II. lation," 

A new feature in their discussi 1 


‚ Ат Sion is the development of a * Method of Excess pun the 
which is put forward as an alternative to existing group-factor methods. To illus fashi 
procedure, the writers print an artifici i i A 


: 6 -factor 
A à elation table is usually arranged before Burt's group ea to 
method is applied ; and, as he h ments; 
“Jike ' 
0 ; namely, the hi iti elations between - 
traits and the comparatively low positi eiue Dite сот i eat 


15 b t in thel 
: t and criticized by Spearman and Hart JD." 
well-known paper (Brit. J. Psych., V, 1913, p. 57), where they юй to show that it оге 
ween mental measurements, On this point, t opose J 
ged his view. As regards the new method here De cross” 
ormula necessarily assumes that all t ] value 
ho- 
' Educati iliti t оп Psye 
logical Tests of Educobl pud f Gen. “cational Abilities (1917) and the Repor 
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Su fang 1 
empiri abis 9" is rarely fulfilled, even approximately, by the figures that are found in 
ere and elsewhere neither the basic part played by Pearson in developing the product- 
EN the principles of multiple correlation, and the method of principal rd 
ae eee та many constructive criticisms, are ever explicitly mentioned. In an early chapter on 
ер riticisms, and later in the text, objections raised by Pearson and his followers are 
m in s cited, and refutations are offered ; but Pearson himself is never referred to by 
CORTA р his ban (as older psychologists in this country are well aware) dates from a heated 
be len g between the two which arose as far back as 1906. But by now it could surely 
eei . These anonymous references, to say the least, are highly misleading to the younger 
d To the statistical reader the most interesting section will doubtless be that which reviews 
TS RI S the methods of factor analysis developed by more recent workers. Spearman 
based. ly takes first the ° bifactor method ° put forward by Holzinger, since that claims to be 
valid „оп the principles underlying his own two-factor theory : the method he accepts as 
He t in *' certain conditions where the Two Factor analysis ceases to be applicable at all.” 
obi urns next to * simple and weighted summation." Here he makes quite clear his main 
ps ов to Burt's procedures. “ Chronologically even earlier,” he writes, ** came anotber 
ER standing attempt at extending the scope of factorizing ; this was contributed by Cyril 
urt: whereas the older method dealt with every tetrad separately, the newer way took the 
s s." Allsuch attempts, however, he considers to be 
P iens His chief objection is to “ the step whereby the vacant diagonal cells are 
Thi mu (by stopgaps) so as to make the whole table into a solid rectangular block." 
d irdly, he briefly examines Thurstone’s centroid method, and quotes his ‘ fundamental 
рөн * This," he says, “is Burt's formula"; hence much the same criticisms are 
of th . And finally he glances at the method of * principal com onents, in which “ the axes 
Hot E (ботан) ellipsoids correspond to the factors.” This is attributed to Kelley and 
K elling, and no reference is made to the earlier outline of the procedure developed by 
arl Pearson. It is summarily dismissed on the ground that, like the centroid method, it 
prodiices negative as well as positive saturations in the supplementary factors. Burt's 
ipolar factors Spearman appears to accept, at any rate in the temperamental field (p. 60). 
p to now, he says, unlike the group and general factors, “ they have attracted comparatively 
Scant attention : but recently they have come more and more into their proper estate as 
Tesearch is turning to the other side of human nature—feeling and volition.” In the 


Cognitive field he rejects them, arguing that intellectual abilities cannot have a negative 
the fact that for Spearman 


influence on actual performance. This criticism is plainly due to а 
а factor is always а causal factor, never а principle of classification. Consequently, he fails 
to understand that, in the view of the writers he is criticizing, when (say) a non-verbal test 
receives a negative saturation, what is negatived is the verbality, not its influence on actual 
Performance. With Holzinger, he tells us, “the concept of * bipolar factor’ has been. 
greatly extended ” ; and he thinks that “in this sphere of factor we must look to him for 
the fullest development." * Y 
What he calls * indirect methods," such as ‘ rotation’ and the use of * oblique axes, 

are rejected because they too readily permit a subjective choice of whatever factors are 
required by the investigator's personal theories. His own method, he says, has been through- 
Out that of partial correlation, with an external criterion rather than an internal. Taking 
external criteria (such as teachers’ assessments for & or the Binet tests of intelligence) the 
Beneral factor is partialled out by Yule's formula, and the residuum examined for supple- 
mentary factors. This, however, is scarcely factor analysis as now understood, since that 
proceeds primarily by seeking internal criteria derived from the table of inter-correlations 
lemseives. i 
To th fessor Spearman, it will be clear that he had lost none of his 
Statistica] ates ang fere his fehting spirit. To stress too ap the p cs 
9f the book would perhaps be unfair. As а pupil and devote! a um pis 


pearman vidently had a difficult task in prep 
manuscript ГА a Ce vg: i ur nded on the work shows how 


d the labour which he has expe e £ js 
loyally he has carried it out. Everyone wi ebted to him for the way In 


Il feel deeply ind i i 
which he has endeavoured to make Professor Spearman's final views ш hy К 
PSychological reader. ^ 
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Psychological Statistics. By Quinn McNemar. | New York: John Wiley & , 
i onion : Chapman & Hall. 1949. Pp. viii + 364. $4.50. 


Dr. McNemar is Professor of Psychology, Statistics, and Education at puntare 
University ; and his text-book is based on fifteen years’ experience in teaching т 
procedures to students of education and psychology. It may be fairly said that it is proba ui 
the best and the most up to date of the many introductory works on the subject that have В 
far appeared on either side of the Atlantic. As Dr. McNemar has anticipated, the їс. 
of his own teacher, Professor Т. L. Kelley, is frequently discernible, and, as a result, pes 
Professor Karl Pearson, under whom Professor Kelley once studied. This perhaps ma s 
its chapters specially suitable for psychological students in this country, since so many pon Я 
and papers published by British psychologists have been, and still are, profoundly indeb dr 
to Karl Pearson's contributions. At the same time, the newer methods introduced by ү 
writers, particularly by Ог. McNemar's other teacher, Professor Hotelling, are almost equally 
in evidence. m 

The manual is designed as a one-year course for senior and graduate students. Exc! е 
for two occasions on which the calculus is used, only an elementary knowledge of mathematic 
is presupposed. The purpose is not so much to provide the student with a knowledge ee 
techniques adequate for original research, but rather to enable him to understand the pla 
and use of statistical concepts and procedures in current psychology. hd 

Three chapters are devoted to the analysis of variance and covariance. Here v 
presentation seems decidedly superior to that adopted in most other elementary text-boo Wy 
but its place in relation to the design of experiments might perhaps have been more rd 
indicated. Further, no indication is given of the peculiar limitations of psychological ү я 
which so often prevent techniques like chi-square or analysis of variance from being ta е 
over as they stand, and applied to problems of psychological research. Throughout the 
volume considerable space is, given to demonstrative treatments of those topics in which t Ü 
non-mathematical thinking is almost as difficult as the mathematical, for example, problem: 
of sampling, and the notions of degrees of freedom and of interaction. 


Nevertheless, the pure statistician, who has always been somewhat critical of the kind of 
statistical procedures favoured by psychologists, will probably have no difficulty in detec ain 
here and there occasional inadequacies and even a little confusion in the discussion gren 
logical problems. The usual distinction between large and small sample methods, whl ^ 
most statisticians would now consider somewhat arbitrary, is preserved. Possibly o s 
they seem simple to the beginner, the methods of analysis appropriate to large samples Ads 
described first ; but it is not made clear that these are really limiting cases of the metho | 
appropriate to small samples, which are statistically more fundamental. Partly asa ro 
of this approach, the questions of sampling, estimation, and the testing of statistical hypo”. 
theses, seem at times to get confused. Thus, in the introduction, the need for methods 0! 
Statistical inference is said to arise “ because of certain inadequacies of research data t 
and the effect is to suggest that the major problem is one of simple estimation. The accoun™ 
of * confidence’ in connexion with statistical inference is restricted almost entirely to t à 
estimation of the population mean ; and there is no discussion of the more general problem 


of statistical inference as they relate to experimentation. The null hypothesis is introduce 
merely as “a better rule-of-thumb procedure ” ; and it is elsewhere stated, perhaps à lit 
casually, that “ chi-square 


" " i n 
; 5 d can likewise be used to tést the null hypothesis.” The discussio" 
of experimental design makes only an incidental appearance. 


о eke pea argel i rief and somewha 
out of date: its limitations are Stressed, but Gp ааа and no hint 
is given of the recent work of Kendall on the su 1 
closely related to methods 
psychological research, it is perhaps a litt 
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and the psychological reader feels at times that the book is not so much an introductory 
review of psychological statistics as a means of introducing the student of psychology to 
techniques which have for the most part been developed in non-psychological fields. 

The book makes a determined attempt to formulate the precise assumptions underlying 
the use of the various procedures as they are described, and to specify the conditions in 
which each method may be validly applied. This is well carried out in the discussion of 
correlation ; but (in the reviewer's opinion) it is less successful in the explanations of the 
chi-square tests. The account given of the procedure to be followed in testing the difference 
between variances of independent random samples is not as complete or as exact as it should 
be : since tables of the variance ratio are, as a rule, appropriate only for the one-tail test 
required in analysis of variance, it is stated that for the present case the tabled probabilities 
should be doubled ; but the author omits to point out that this procedure is only justifiable 
when the two samples are of equal size, nor indeed does he sufficiently bring out the importance 
of considering whether the assumption of homogeneity of variance is justified in particular 


cases. These, however, are blemishes which could perhaps be rectified in a later edition, 
A. S. 


Experimental Design in Psychological Research. -By Allen L. Edwards. New York: 


Rinehart & Co. Inc. 1950. Pp. xvi + 446. 85.00. 

Any text-book seeking to expound statistical procedures for psychological students 
has nowadays to keep a double aim in view. First, it should describe the more important 
techniques which are accepted by professional statisticians and which appear appropriate to 
the types of problem common in psychological research, even though such techniques have 
but rarely been adopted by psychologists. But secondly it must also explain those statistical 
methods which have in fact been in frequent use among psychologists themselves, and are 
therefore likely to be encountered in the books and periodicals consulted by the psycho- 
logical student. If such methods have not yet been recognized by the professional statistician, 
or if they are now giving way to more rigorous procedures, these are not reasons for ignoring 
them altogether, but rather for explaining their imperfections. 

This double task did not confront the earlier writers of such text-books, such as 
Thorndike, Kelley, or William Brown. The mathematical methods developed by Fechner and 
by the Galton-Pearson school were elaborated largely with psychological problems in mind. 
But most of the newer techniques in statistics have arisen out of work in very different fields. 
As mathematical devices they may be admirable in themselves ; but too often they depend, 
explicitly or implicitly, theoretically or practically, on assumptions which may be valid 
enough in the field for which mee originally лавы, but are not always appropriate 

а і i sychologist has to deal. А i 
Р НН eg mehr cum] that an illustration is perhaps desirable. In a recent 
study of tests applied to normal and neurotic groups (the latter саша into various 
subgroups) the investigator explains that he has applied the sape ind of analysis as is to 
e forgets that the distinction 


be found in Snedecor’s chapter on the analysis of variance. tinction 
between the normal and the neurotic is by no means so clear-cut as between ' King 
that, had the measurements furnished by his 


and ‘Standard’ varieties of apples, and that, ie measurements Purni tied Hy e 
i ted, the reliability wou ave be г. і 

КОШЕ КП e. s lur tthe weight or the quantity of fruit. The critical psychologist 

ity of the figures oi rutt, esr 


| whether the differences 1 
ste tet ea the E. borrowed, and whether they do not render these more refined 


» :Reati d 
i i i How far, for example, do the classifications an 
working methods unnecessarily laborious. TOW " ampli a сс үн 
i logist yield anything p ў ral 
the experimental procedures. ae ments” cd we erp Р P vin d 
x 1 yself share A 
, "ker. t suggest that 1 my are t V 
"xp an ur rd T i Edu a statistical psychologists in this country so 
, y, dou 


E approach, f f the current text-books on statistical 

ira s they are, very few 0 [ titel 
pushes aie tone pes difficulties. In his latest үш rote risen e has 
fulfilled the first of the two aims I have mentioned Fond he sects almost wholly to overlook. 
doubt it is the more important of the two. 90 


itle suggests, NIS genera арргоас ollow: outli Fisher in The Design 

As the tit hi llows that utlined by Fish Т) 

- З А of the book deals with the ar alysis of variance and 
a 


of Experiments. Hence a large р en 
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i and with the planning of factorial designs of varying degrees of complexity. 
Since atio author Кос A [ee deeply he is indebted to the work of British іа оаа 
like M. S. Bartlett, М. С. Kendall, Е. Yates, Udny Yule, Karl Pearson, and Egon Peali M 
as well as Fisher himself. And the number of actual researches on which he is able to d 
in his search for concrete examples and illustrations shows how rapidly the work of su Еа 
writers has begun to influence research-procedures in America. Hardly any, however i T 
taken from psychological experiments in this country. This is not because the worl с 
British psychologists has been ignored, but because, until quite recently, it has so on 
adopted these newer techniques. Yet, if one turns back to some similar volume issued tl ү d 
years ago, one would find that nearly half the references relate to statistical work mnm 
by psychologists in this country. Today how many, among our senior colleagues in Bri В я 
psychological departments, could claim even a nodding acquaintance with “ the nea 
developments in statistical analysis ” due to British statisticians, and how many arrange fo! 
them to be taught to their research students ? d 
Nevertheless, I cannot help feeling that the attractive title chosen by Professor Edwar 

is a little too specific. Except for the fact that the examples are derived almost wholly гош 
psychology, the title might almost as well have been phrased * Experimental Design h 
Statistical Research.’ Comparatively few of the methods described are characteristic x Hd 
typical or commoner kinds of experiment carried out in the psychological laboratory. Th 
special statistical methods developed and used so persistently by psychologists themselves- 
for example, the constant process, the ranking method, the so-called discriminant metho! 3 
and factor analysis in its various branches—are not mentioned at all. Formerly Statistica 
were apt to retort, with Professor Filon, that “ such devices are so amateurish that they shou 
be discouraged rather than encouraged." Yet Kendall has shown that ranking procedures 
can be established on a sound basis; Cramér's text-book includes references, not only is 
Pearson's method of principal components, but also to factor analysis by name ; while 


Fisher's own books include discussions of discriminant functions and the treatment of ШЕ 
m Tesponse curve by the probit method (i.e., what psychologists know as the consta 
process). 


,, I think, too, it is a little unfortunate that the treatment might at first seem to imply that, 
ifa psycho. ogica research is to be well i 


ir to criticize an author for not discussing topics which he, аш 
not intend to take up. Professor Edwards may well reply that, at any rate for America 
students, these other matters are already sufficiently covered, and that his special puree 
was to fill a gap that still needed filling. In his preface he explains that his primary object 
was “ to present to the student in psychology, education, sociology, and the behaviour e 
Sciences some of the newer in statistical analysis, particularly with repe 
problems of research and experimentation 
pose, then it will be agreed that it has bie 
) ountry who have already assimilated the 0 
hniques, perhaps by the aid of Professor Edwards’ previous volume; 
there could be no better * newer techniques." C. B. 


The Backward Child. By Cyril Burt. London: Univer: n Press. Third 
Edition, 1950. Pp. xx +704, 2з. Чен GP Lisa 
Sir Cyril Burt's book is concerned primarily with the causes and the treatment of back- 
wardness among school children ; but it provides at the same time an admirable introduction 
to the study of individual differences among children of school age. Its chapters present 
in systematic form the results of a long series of investigations carried out by himself and by 
teachers working under his directi h 


to incorporate further knowledge accumulate 
ole vol 


isti > ; ume has now b Пу revised 
statistical sections considerably expanded. een carefully 
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In his earlier work on The Young Delinguent two new methodological features were 
introduced. The first consisted in an attempt to combine the case-study method with the 
statistical : the intention here was to use the quantitative approach to check the qualitative, 
and the qualitative to interpret the quantitative. The second consisted in the decision to 
carry out a parallel study on a control group, side by side with the study of the crucial group. 
The same two principles have been adopted in the investigation of educational backwardness. 

In both cases the scheme underlying the examination of each child follows the same 
comprehensive lines, and is itself based on the results of prior statistical studies carried out by 
factorial methods. This is the portion of the work that has most frequently excited criticism, 
particularly from those who favour a different view of * the factors of the mind.’ Burt has 
always held that the apparent occurrence of both general and special disabilities among 
school children requires a fairly complex conception of mental processes—one which will 
include group factors as well as general factors. Not unnaturally those who set great store 
by * economy in psychological assumptions " were tempted to reply that one type of factor 
should surely be sufficient. To teachers outside training colleges and to practical psychol- 
ogists concerned with the actual classification of subnormal children any such simplified 
explanations seemed scarcely workable. There was, however, an urgent need for a set 
of crucial enquiries, carried out by someone equally familiar with theoretical and with 
practical needs, along lines which would decide between the chief alternatives. 

In this country, at the time when the volume was first published, Spearman's two-factor 
theory was still dominant. With a few dubious exceptions, so it was maintained, a single 
general ability would account for all individual differences ; and the teacher's notion of special 
disabilities was dismissed as an out-of-date relic of the old faculty theory, which both general 
and statistical psychology had at length discarded. On the other hand, in America Thurstone 
and his followers had introduced the opposite picture: the hypothesis of a general factor 
was rejected, and the existence of group factors alone admitted. A somewhat similar 
standpoint had previously been suggested by Thomson. 

The appointment of an official psychologist working in the London schools enabled 
researches to be planned and carried out on a scale that would have been quite impossible for 
the isolated investigator. The wide and varied data thus brought together in Burt's book 
appears to provide a conclusive answer to the issues raised : both the statistical analyses 
and the examination of numerous case-histories render the double hypothesis indispensable. 
We seem then bound to assume first an innate general ability, resembling what is commonly 
called intelligence, and secondly a number of innate abilities and disabilities in particular 
cognitive processes, such as memory, verbal capacity, and the like. The fact that, despite 
attacks from two opposite fronts, this view has now won increasing acceptance points plainly 
to the great methodological vaiue of joining statistical with non-statistical modes of approach 
in all researches of this kind. Certainly, if one may judge from its fruitfulness in the present 
volume, Burt's hierarchical scheme of factors, including as it does both cognitive and orectic 
tendencies, with the broader or more general factors ramifying into narrower, lends itself 
admirably to the elucidation of the problems of individual psychology, at least so far as they 
arise in work among school children. m n Á-— 

In discussing the place of quantitative methods in investigations of this kind, the author 
insists that statistical psychology is not to be treated as a distinct or self-contained branch : 
the use of quantitative procedures is “ merely an extension or refinement. of the logical 
techniques which are essential to all scientific research and argument.” In the main, there- 
fore, mathematical technicalities have been kept in the background ; and in the text the 
critical ratios, tetrachoric correlations, and factor-saturations are merely cited incidentally 


in the course of the general argument to confirm or disprove the various alternative hypotheses. 

During recent years, however, an increasing number of educationists, schoolmasters, 
and psychologists at child guidance centres have been undertaking statistical investigations 
on educational backwardness and kindred problems. Accordingly, in the present edition, 
the original appendices have been enlarged to supply a somewhat fuller account of the Ve 
useful computing devices and formulz, and detailed references to recent publications nave 


been added. But what should appeal to the statistical investigator most of all is the long 
and suggestive list of problems still 5 


awaiting systematic research. 
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